REMARKS 



Applicants had inadvertently renumbered the claims in the response to the 
second restriction requirement filed on 21 December, 2007. Applicants have herein 
corrected the error and have numbered the claims to reflect the claim numbers provided 
in the application as filed. Accordingly, pursuant to the entry of the instant amendment, 
Claims 1-5, 8-10, 19-23, 25-26, 32-40, 44-47, 56-60, 62-63, and 69-73 are pending; 
Claims 37-40, 44-47, 56-60, 62-63, and 69-73 are withdrawn; and Claims 6, 7, 11-18, 
24, 27-31, 41-43, 48-55, 61, 64-68 and 74-145 are cancelled without prejudice. Claiml 
was amended to recite that the target moiety is a carbohydrate, which was elected 
pursuant to the second restriction requirement mailed on 1 1/26/2007. The pending 
dependent claims have been amended to restore the dependency to the original claim 
numbers. Withdrawn Claim 37 was amended to correspond to the amendment made in 
Claim 1, and the remaining withdrawn dependent claims were amended to restore the 
dependency to the original claim numbers. 

I. Amendments to the Specification 

In response to the Examiner's objection that the recitation of residue numbers in Claim 
10 render the claims indefinite "as it is unclear what these amino acids residues are 
referred to as no amino acid sequence is presented in the specification either by SEQ ID 
number or by accession number" (see page 3 of the Office Action), Applicants have 
amended the specification to address the rejection under 35 USC 112, second 
paragraph. 

II. Sequence Compliance 

Applicants have amended the specification to recite the Accession numbers of 
the Bacillus amyloliquefaciens subtilisin being CAA24990 (Appendix 1), and the 
Accession number of the Bacillus lentus subtilisin (SBL) (Appendix 2) as being P29600. 
The Bacillus lentus subtilisin is used interchangeably with GG36. For example, the 
disclosure cites at page 46, lines 17-20 the U.S. patent 5,185,258, which refers to the 
gene encoding the Bacillus lentus subtilisin (herein referred to as SBL) as GG36 (see for 
example Examples 2 and 3 of U.S. patent 5,185,258 at column 14, line 51 to column 16, 
line 47 _ Appendix 3), and refers to the product of the Bacillus lentus subtilisins gene 
product as GG36. For example see column 15, lines 44-46 which refer to the mature 
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gene product as GG36. Applicants submit that the recitation of the protein accession 
numbers renders moot the request for a sequence listing. 

III. Claim Objections 

The Examiner objected to Claims 1-5, 8-10, and 19-36 for containing non-elected 
subject matter. 

Accordingly, applicants have amended Claim 1 to recite "a carbohydrate" as 
being the target moiety elected in response to the second restriction requirement a 
mailed on 11/26/2007. The amendment to Claim 1 renders moot the objection of 
dependent claims 2-5, 8-10, and 19-36. 

IV. Claim Rejections under 35 U.S.C. §112, second paragraph 

The Examiner has rejected Claim 10 under 35 U.S.C. §112, second paragraph 
for being "unclear what these amino acid residues are referred to as no amino acid 
sequence is presented in the specification either by SEQ ID number or by accession 
number". 

As discussed above, Applicants have amended the specification to recite protein 
accession numbers for the Bacillus lentus subtilisin (SBL) as Protein Accession Number 
P29600, and Claims 1 , 10 and 47 have been amended accordingly to recite the B. lentus 
subtilisin accession number. 

Therefore, the reference residues recited in the claims are no longer undefined. 

In light of the foregoing, Applicants respectfully request that the rejection of Claim 
10 under 35 U.S.C. §112, second paragraph be withdrawn. 

V. Claim Rejections under 35 U.S.C. §112, first paragraph 

(A) The Examiner has rejected Claims 1-5,8-10,1 2-29 (now numbered 1 9- 
36 as originally filed) under 35 U.S.C. §112, first paragraph as containing subject matter 
which was not described in the specification in such a way as to reasonably convey to 
one skilled in the relevant art that the inventor(s), at the time the application was filed, 
had possession of the claimed invention. 

In particular, the Examiner states that "these claims are directed to a genus of 
catalytic antagonists comprising chimeric proteins comprising any subtilisins-type serine 
hydrolase conjugated through the sulfur group on a cysteine of said hydrolase with a 
genus of targeting moiety". Further, the Examiner states that "the specification teaches 
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the structure of only a few such chimeric proteins", and that "the specification fails to 
describe any other representative species by any identifying characteristics other than 
the functionality of [the] chimeric protein". 

Applicants respectfully traverse the rejection for the following reasons. 

First, Applicants have amended Claim 1 to recite that the targeting moiety is a 
carbohydrate. Therefore, Claim 1 as amended is specifically directed to subtilisin type 
serine proteases that comprise a carbohydrate targeting moiety, and not to any targeting 
moiety. Second, in addition to teaching the functionality of the chimeric proteins, the 
specification also teaches the functionality of the subtilisin serine proteases. For 
example, at page 13, line 32 to page 14, Iine16, the disclosure provides numerous 
references and teaches that " the "subtilisin type serine proteases" refer to a family of 
serine hydrolyases based on structural homology to enzymes derived from Bacillus 
subtilus, including subtilisin BPN' (Bott era/. (1988) J. Biol. Chem. 263: 7895-7906; 
Siezen and Louise (1997) Protein Science 6: 501-523; Bartlett and Rawlings (1994) 
Meth. Enzymol., 244: 19-61 , Academic Press, S.D.). " The specification further teaches 
the structural characteristics that discern the subtilisin serine protease from the 
chymotrypsin serine proteases, and discloses that while the amino acid sequences of 
the subtilisin serine proteases are not entirely homologous, they "exhibit the same or 
similar type of proteolytic activity". The specification also discloses specific residues of 
subtilisins type serine proteases that can be substituted and modified to create the 
claimed catalytic antagonists (for example, see page 42, line 8 to page 43, line 7, and 
page 44, line 12 to page 46, line 2, and Examples 1-12). Furthermore, the specification 
also discloses the targeting moieties that can be conjugated to the modified antagonists 
(for example see page 49, line 17 to page 50, line 20 and Examples 1-12). 

Thus, the specification provides adequate description of the claimed catalytic 
antagonist comprising a targeting moiety that specifically binds to said target molecule 
said targeting moiety being a carbohydrate.attached to an enzyme, said enzyme being a 
subtilisin-type serine protease" as recited in Claim 1. Applicants submit that the 
specification sufficiently describes the claimed invention, and shows that Applicants 
were indeed in possession of the claimed invention at the time the application was filed. 

In light of the foregoing, Applicants respectfully request that the rejection of 
Claims 1-5, 8-10, 12-29 (now numbered 19-36 as originally filed) under 35 U.S.C. §112, 
first paragraph, be withdrawn. 
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(B) The Examiner has rejected Claims 1 -5, 8-10,1 2-29 (now numbered 1 9- 
36 as originally filed) under 35 U.S.C. §112, first paragraph, "because the specification , 
while being enabling for [a] catalytic antagonist comprising [a] chimeric protein of a 
Bacillus lentus subtilisin-type serine hydrolase conjugated with [a] mannose type 
oligosaccharide... to target mannose binding lectin... [does] not reasonably provide 
enablement for any catalytic antagonist comprising [a] chimeric protein comprising any 
subtilisins type serine hydrolase with a genus of targeting moiety comprising any 
structural and functional features". 

Applicants respectfully traverse the rejection for the following reasons. 

As discussed above, Claim 1 has been amended to recite "carbohydrate" as 
being the targeting moiety. The specification teaches which residues to select for 
substitution and conjugation, and discloses specific sites in B. lentus subtilisins and 
other related enzymes (see for example page 42, line 24 to page 43, line 2). Methods 
for introducing the cysteine residue in the serine hydrolase are described at least at 
page 44, line 12 to page 46, line 2. The disclosure also teaches methods for coupling 
the targeting moieties to the cysteine (see page 49, line 16 to page 50, line 20), and 
provides specific examples relating to the synthesis of carbohydrate modified serine 
hydrolases (Example 5 at page 81), and the use of a mannosylated serine hydrolase to 
target mannose binding lectin (Example 6 at page 82). 

Therefore, the specification does teach a person skilled in the art to make and 
use the claimed invention. 

In light of the foregoing, Applicants respectfully request that the rejection of 
Claims 1-5, 8-10, 12-29 (now numbered 19-36 as originally filed) under 35 U.S.C. §112, 
first paragraph for lacking enablement, be withdrawn. 
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CONCLUSION 

Applicants submit that this paper fully addresses the Office Action mailed March 
20, 2008. Applicants believe the pending claims are in condition for allowance and 
issuance of a formal Notice of Allowance at an early date is respectfully requested. If a 
telephone conference would expedite prosecution of this application, the Examiner is 
invited to telephone the undersigned at (650) 846-7636. 

This paper is accompanied by a request for Extension of Time under 37 C.F.R. 
1 .136(a) of THREE months extending the time for response to September 20, 2008. 
This response is filed prior to the extended deadline and is therefore timely filed. The 
Commissioner is authorized to charge any fees that may be required in connection with 
this submission and to credit any overpayments to Deposit Account No. 07-1048 
(Attorney Docket No. GC571-2-C1). 

Respectfully submitted, 

Date: September 19, 2008 /Elena E. Quertermous/ 

Elena E. Quertermous 
Registration No. 47,873 

Danisco U.S., Genencor Division 
925 Page Mill Road 
Palo Alto, CA 94304 
Tel: 650-846-7636 
Fax: 650-845-6504 
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" " " " " ^Protein rIU 



•••••• 

Structure PMC 



Search (Protein" 



"3 for[ 



Display jGenPept 3 Show|5 3| Sendto -lJ 

Range: from [begin to [end Features: T CDD Refresh I 



ri: CAA24990. Reports precursor subtili...[gi:773560] 
Comment Features Sequence 



BLink, Conserved 
Domains, Links 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCE 
KEYWORDS 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 

FEATURES 

source 



CAA24990 376 aa linear BCT 17-NOV-2004 

precursor subtilisin [Bacillus amyloliquef aciens ] . 
CAA24 990 

CAA24990.1 GI:773560 
embl accession X00165.1 



Bacillus amyloliquefaciens 
Bacillus amyloliquefaciens 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 
1 (residues 1 to 376) 

Wells, J. A., Ferrari, E., Henner,D.J., Estell,D.A. and Chen, E.Y. 
Cloning, sequencing, and secretion of Bacillus amyloliquefaciens 
subtilisin in Bacillus subtilis 
Nucleic Acids Res. 11 (22), 7911-7925 (1983) 

6316278 

On Apr 17, 1995 this sequence version replaced gi: 762917. 
Data kindly reviewed (01-JUL-1985) by D.A. Estell. 
Location/Qualifiers 
1. .376 

/organism="Bacillus amyloliquefaciens" 
/db_xref="taxon: 1390" 
Protein 1 . . 376 

/product ="precursor subtilisin" 
sig_peptide 1 . . 27 

/note="coding sequence signal peptide" 
CDS 1..376 

/coded_by="X00165. 1:111. . 1241" 
/note="tgg start" 
/transl_table=ll 
/db_xref="GOA:P00782" 
/db_xref="PDB: 1A2Q" 
/db_xref="PDB: 1AK9" 
/ db_xref="PDB : 1AQN" 
/db_xref="PDB: 1AU9" 
/db_xref="PDB: 1DUI" 
/db_xref="PDB: 1GNS" 
/db_xref="PDB: 1GNV" 
/db_xref="PDB: 1LW6" 
/db_xref="PDB: 1S01" 
/db_xref="PDB: 1S02" 
/db_xref="PDB: 1SBH" 
/db_xref="PDB: 1SBI" 
/db_xref="PDB: ISBN" 
/db xref="PDB: 1SBT" 
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/db_xref="PDB: 1SIB" 
/db_xref="PDB: 1SPB" 
/db_xref="PDB:lST2" 
/db_xref="PDB: 1SUA" 
/db_xref="PDB: 1SUB" 
/db_xref="PDB: 1SUC" 
/db_xref="PDB: 1SUD" 
/db_xref="PDB: 1SUE" 
/db_xref="PDB: 1SUP" 
/db_xref="PDB: 1TM1" 
/db_xref="PDB:lTM3" 
/db_xref="PDB: 1TM4" 
/db_xref="PDB: 1TM5" 
/db_xref="PDB: 1TM7" 
/db_xref="PDB: 1TMG" 
/db_xref="PDB: 1T01" 
/db_xref="PDB: 1T02" 
/db_xref="PDB: 1UBN" 
/db_xref="PDB: 1V5I" 
/db_xref="PDB: 1YJA" 
/db_xref="PDB: 1YJB" 
/db_xref="PDB: 1YJC" 
/db_xref="PDB:2SBT" 
/db_xref="PDB:2SIC" 
/db_xref="PDB:2SNI" 
/db_xref="PDB:2STl" 
/db_xref="PDB: 3SIC" 
/db_xref="PDB:5SIC" 

/db_xref="UniProtKB/Swiss-Prot: P00782" 

I 

1 misllfalal iftmafgsts saqaagksng ekkyivgfkq tmstmsaakk kdvisekggk 
61 vqkqfkyvda asatlnekav kelkkdpsva yveedhvaha yaqsvpygvs qikapalhsq 
121 gytgsnvkva vidsgidssh pdlkvaggas mvpsetnpfq dnnshgthva gtvaalnnsi 
181 gvlgvapsas lyavkvlgad gsgqyswiin giewaiannm dvinmslggp sgsaalkaav 
241 dkavasgvvv vaaagnegts gssstvgypg kypsviavga vdssnqrasf ssvgpeldvm 
301 apgvsiqstl pgnkygayng tsmasphvag aaalilskhp nwtntqvrss lentttklgd 
361 sfyygkglin vqaaaq 
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•1 Protein \$wm ^ 



Search | Protein 



forf 



Display jGenPept "Tj Show F"Z3| Sendt0 _zJ 

Range: from [begin j to |end Features: T CDD +j Refre sh j 

H1: P29600. Reports Subtilisin Savina...[gi:267048] 
Comment Features Sequence 



BLink, Conserved 
Domains, Links 



269 aa linear BCT 15-JAN-2008 

(Alkaline protease) . 



LOCUS P29600 
DEFINITION Subtilisin Sa' 
ACCESSION P29600 
VERSION P29600.1 GI:267048 

DBSOURCE swissprot: locus SUBS_BACLE, accession P2 9600; 
class: standard, 
created: Apr 1, 1993. 
sequence updated: Apr 1, 1993. 
annotation updated: Jan 15, 2008. 

xrefs: 1C9J, 1C9M, 1C9N, 1GCI, 1IAV, 1JEA, 1NDQ, 1NDU, 1Q5P, 1SVN, 
1TK2 

xrefs (non-sequence databases): PDBsum : 1C9J, PDBsum: 1C9M, 
PDBsum:lC9N, PDBsum: 1GCI, PDBsum: 1IAV, PDBsum: 1JEA, PDBsum: 1NDQ, 
PDBsum: 1NDU, PDBsum: 1Q5P, PDBsum: 1SVN, PDBsum: 1TK2, LinkHub : P2 9600 , 
GO:0004291, InterPro : I PR000209, InterPro: IPR015500, 
Gene3D:G3DSA: 3. 40.50.200, PANTHER: PTHR10795, Pf am: PF00082, 
PRINTS: PR00723, PROSITE : PS001 36, PROSITE : PS00137 , PROSITE : PS00138 

KEYWORDS 3D-structure; Calcium; Hydrolase; Metal-binding; Protease; 
Secreted; Serine protease; Sporulation. 

SOURCE Bacillus lentus 

ORGANISM Bacillus lentus 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

REFERENCE 1 (residues 1 to 269) 

AUTHORS Betzel,C, Klupsch,S., Papendorf , G . , Hastrup,S., Branner,S. and 
Wilson, K.S. 

TITLE Crystal structure of the alkaline proteinase Savinase from Bacillus 

lentus at 1.4 A resolution 
JOURNAL J. Mol. Biol. 223 (2), 427-445 (1992) 

PUBMED 1738156 
REMARK X-RAY CRYSTALLOGRAPHY (1.4 ANGSTROMS). 
REFERENCE 2 (residues 1 to 269) 

AUTHORS Remerowski,M.L. , Pepermans , H . A. , Hilbers,C.W. and Van De Ven,F.J. 
TITLE Backbone dynamics of the 269-residue protease Savinase determined 

from 15N-NMR relaxation measurements 
JOURNAL Eur. J. Biochem. 235 (3), 629-640 (1996) 

PUBMED 8654411 
REMARK STRUCTURE BY NMR. 
REFERENCE 3 (residues 1 to 269) 

AUTHORS Kuhn,P., Knapp,M., Soltis,S.M., Ganshaw,G., Thoene,M. and Bott,R. 
TITLE The 0.78 A structure of a serine protease: Bacillus lentus 

subtilisin 

JOURNAL Biochemistry 37 (39), 13446-13452 (1998) 

PUBMED 97534 30 
REMARK X-RAY CRYSTALLOGRAPHY (0.78 ANGSTROMS). 
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FEATURES 

source 



[FUNCTION] Subtilisin is an extracellular alkaline serine protease, 
it catalyzes the hydrolysis of proteins and peptide amides. 

[CATALYTIC ACTIVITY] Hydrolysis of proteins with broad specificity 
for peptide bonds, and a preference for a large uncharged residue 
in PI. Hydrolyzes peptide amides. 

[COFACTOR] Binds 2 calcium ions per subunit . 

[SUBCELLULAR LOCATION] Secreted. 

[BIOTECHNOLOGY] Used as a detergent protease. Sold under the name 
Savinase by Novozymes. 

[MISCELLANEOUS] Secretion of subtilisin is associated with onset of 
sporulation, and many mutations which block sporulation at early ^ 
stages affect expression levels of subtilisin. However, subtilisin 
is not necessary for normal sporulation. 
[SIMILARITY] Belongs to the peptidase S8 family. 

Location/Qualifiers 

1. .269 

/organism="Bacillus lentus" 
/db_xref="taxon: 14 67" 
i 1..269 

/product="Subtilisin Savinase" 
/EC_number="3 . 4 . 21 . 62" 
1. .269 

/region_name="Mature chain" 
/experiment="experimental < 
recorded" 

/note="Subtilisin Savinase. 



vidence, no additional details 
/FTId=PRO 0000076418." 



/site_type="metal-binding" 
/experiment="experimental evidence, 
recorded" 

/note="Calcium 1." 
6. .10 

/region_name="Helical region" 
/experiment=" experimental evidence, 
recorded" 
13. . 18 

/region_name="Helical region" 

/experiment="experimental evidence, no additional details 

recorded" 

27. .33 

/region_name=" Beta-strand region" 
/experiment="experimental evidence, 
recorded" 

32 

/site_type="active" 

/experiment="experimental evidence, 
recorded" 

/note="Charge relay system." 
40 

/site_type="metal-binding" 

/experiment="experimental evidence, no additional details 
recorded" 

/note="Calcium 1." 
43. . 48 

/region_name= "Beta-strand region" 

/experiment="experimental evidence, no additional details 

recorded" 

59. . 61 

/region_name="Beta-strand region" 

/experiment="experimental evidence, no additional details 



additional details 



no additional details 



no additional details 



no additional details 
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recorded" 
62. .71 

/region_name="Helical region" 

/experiment="experimental evidence, no additional details 

recorded" 

62 

/site_t ype=" active" 

/experiment="experimental evidence, no additional details 
recorded" 

/note="Charge relay system." 
73 

/site_type="metal-binding" 

/experiment="experimental evidence, no additional details 
recorded" 

/note="Calcium 1 (via carbonyl oxygen)." 
75. .78 

/region_name="Beta-strand region" 

/experiment="experimental evidence, no additional details 

recorded" 

75 

/site_type= "metal-binding" 

/experiment="experimental evidence, no additional details 
recorded" 

/note="Calcium 1." 
77 

/ s it e_type= "metal -binding" 

/experiment="experimental evidence, no additional details 
recorded" 

/note="Calcium 1 (via carbonyl oxygen)." 
79 

/site_type="metal-binding" 

/experiment="experimental evidence, no additional details 
recorded" 

/note="Calcium 1 (via carbonyl oxygen)." 
87. .92 

/region_name="Beta-strand region" 

/experiment="experimental evidence, no additional details 

recorded" 

102. .114 

/region_name="Helical region" 

/experiment="experimental evidence, no additional details 

recorded" 

118. .122 

/region_name=" Beta-strand region" 

/experiment="experimental evidence, no additional details 

recorded" 

126. . 128 

/region_name="Beta-strand region" 

/experiment="experimental evidence, no additional details 

recorded" 

131. .142 

/region_name="Helical region" 

/experiment="experimental evidence, no additional details 

recorded" 

146. . 150 

/region_name="Beta-strand region" 

/experiment="experimental evidence, no additional details 

recorded" 

162. .164 

/region_name="Hydrogen bonded turn" 
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/experiment="experimental evidence, no additional details 
recorded" 
Site 163 

/site type="metal-binding" 

/experiment="experimental evidence, no additional details 
recorded" 

/note="Calcium 2 (via carbonyl oxygen)." 
Site 165 

/site_type="metal-binding" 

/experiment="experimental evidence, no additional details 
recorded" 

/note="Calcium 2 (via carbonyl oxygen)." 
Region 168.. 174 

/region_name=" Bet a- strand region" 

/experiment="experimental evidence, no additional details 
recorded" 
Site 168 

/site__type="metal-binding" 

/experiment="experimental evidence, no additional details 
recorded" 

/note="Calcium 2 (via carbonyl oxygen)." 
Region 190.. 195 

/region_name="Beta-strand region" 

/experiment="experimental evidence, no additional details 
recorded" 
Region 197.. 203 

/region_name=" Beta -strand region" 

/experiment="experimental evidence, no additional details 
recorded" 
Region 204.. 206 

/region_name="Hydrogen bonded turn" 

/experiment="experimental evidence, no additional details 
recorded" 
Region 207.. 211 

/region_name= "Beta-strand region" 

/experiment="experimental evidence, no additional details 
recorded" 
Region 214.. 231 

/region_name="Helical region" 

/experiment="experimental evidence, no additional details 
recorded" 

Site 215 

/ si te_type=" active" 

/experiment="experimental evidence, no additional details 
recorded" 

/note="Charge relay system." 
Region 237.. 246 

/region_name="Helical region" 

/experiment="experimental evidence, no additional details 
recorded" 
Region 254.-257 

/region_name="Helical region" 

/experiment="experimental evidence, no additional details 
recorded" 
Region 264.-267 

/region_name="Helical region" 

/experiment="experimental evidence, no additional details 
recorded" 

ORIGIN 

1 aqsvpwgisr vqapaahnrg Itgsgvkvav ldtgisthpd lnirggasfv pgepstqdgn 
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61 ghgthvagti aalnnsigvl gvapsaelya vkvlgasgsg svssiaqgle wagnngmhva 
121 nlslgspsps atleqavnsa tsrgvlvvaa sgnsgagsis yparyanama vgatdqnnnr 
181 asfsqygagl divapgvnvq stypgstyas lngtsmatph vagaaalvkq knpswsnvqi 
241 rnhlkntats lgstnlygsg lvnaeaatr 

// 
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1 AATTCTCCAT TTTCTTCTGC TATCAAAATA ACAGACTCGT GATTTTCCAA 
aprE promoter region 
51 ACGAGCTTTC AAAAAAGCCT CTGCCCCTTG CAAATCGGAT GCCTGTCTAT 
aprE promoter region 

Eagl 
Not I 

101 AAAATTCCCG ATATTGGTTA AACAGCGGCG CAATGGCGGC CGCATCTGAT 

aprE promoter region 
151 GTCTTTGCTT GGCGAATGTT CATCTTATTT CTTCCTCCCT CTCAATAATT 

aprE promoter region 
201 TTTTCATTCT ATCCCTTTTC TGTAAAGTTT ATTTTTCAGA ATACTTTTAT 

aprE promoter region 
251 CATCATGCTT TGAAAAAATA TCACGATAAT ATCCATTGTT CTCACGGAAG 

aprE promoter region 

3 01 CACACGCAGG TCATTTGAAC GAATTTTTTC GACAGGAATT TGCCGGGACT 

aprE promoter region 

351 CAGGAGCATT TAACCTAAAA AAGCATGACA TTTCAGCATA ATGAACATTT 

aprE promoter region 
401 ACTCATGTCT ATTTTCGTTC TTTTCTGTAT GAAAATAGTT ATTTCGAGTC 

aprE promoter region 

4 51 TCTACGGAAA TAGCGAGAGA TGATATACCT AAATAGAGAT AAAATCATCT 

aprE promoter region 

501 CAAAAAAATG GGTCTACTAA AATATTATTC CATCTATTAC AATAAATTCA 

aprE promoter region 
551 CAGAATAGTC TTTTAAGTAA GTCTACTCTG AATTTTTTTA AAAGGAGAGG 

AprE signal peptide 
aprE promoter region 

M RSKK L W I SLL FALT 
601 GTAAAGAGTG AGAAGCAAAA AATTGTGGAT CAGCTTGTTG TTTGCGTTAA 
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• LIP T M A FSNM SAQ ADD 
651 CGTTAATCTT TACGATGGCG TTCAGCAACA TGTCTGCGCA GGCTGATGAT 



Y S V V EEH GQL SISN GEL' 
701 TATTCAGTTG TAGAGGAACA TGGGCAACTA AGTATTAGTA ACGGTGAATT 



•VNE RGEQ VQL K G M SSHG 
751 AGTCAATGAA CGAGGCGAAC AAGTTCAGTT AAAAGGGATG AGTTCCCATG 



101 GTTTGCAATG GTACGGTCAA TTTGTAAACT ATGAAAGCAT GAAATGGCTA 



RDDW GIT VFR A A M Y T S S ■ 
851 AGAGATGATT GGGGAATAAC TGTATTCCGA GCAGCAATGT ATACCTCTTC 



GGY IDDP S V X EKV KETV 
901 AGGAGGATAT ATTGACGATC CATCAGTAAA GGAAAAAGTA AAAGAGACTG 



■ E A A IDL GIYV IID WHI 
951 TTGAGGCTGC GATAGACCTT GGCATATATG TGATCATTGA TTGGCATATC 



b 3 O M D P N IYK E3AK D F F ■ 
1001 CTTTCAGACA ATGACCCGAA TATATATAAA GAAGAAGCGA AGGATTTCTT 



•DEM SELY GDY PNV IYEI 
1051 TGATGAAATG TCAGAGTTGT ATGGAGACTA TCCGAATGTG ATATACGAAA 



. TTGCAAATGA ACCGAATGGT AGTGATGTTA CGTGGGACAA TCAAATAAAA 



P Y A E EVI PVI RDND PNN 
1151 CCGTATGCAG AAGAAGTGAT TCCGGTTATT CGTGACAATG ACCCTAATAA 
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■IVI VGTG TWS QDV H H A A ■ 
1201 CATTGTTATT GTAGGTACAG GTACATGGAG TCAGGATGTC CATCATGCAG 



• DNQ LAD PNVM YAF H F V 
1251 CCGATAATCA GCTTGCAGAT CCTAACGTCA TGTATGCATT TCATTTTTAT 



AGTH GQN LRD QVDY A L D 
1301 GCAGGAACAC ATGGACAAAA TTTACGAGAC CAAGTAGATT ATGCATTAGA 



• Q G A AIFV SEW GTS A A T G • 
13 51 TCAAGGAGCA GCGATATTTG TTAGTGAATG GGGGACAAGT GCAGCTACAG 



DERN LSW ANW SLTH K D E ■ 
1451 GATGAAAGAA ATTTAAGCTG GGCCAACTGG TCTCTAACGC ATAAGGATGA 



1501 GTCATCTGCA GCGTTAATGC CAGGTGCAAA TCCAACTGGT GGTTGGACAG 



• A E L SPS GTFV R E K IRE 
1551 AGGCTGAACT ATCTCCATCT GGTACATTTG TGAGGGAAAA AATAAGAGAA 



1st CBD Linker 



1601 TCAGCATCTA TTCCGCCAAG CGATCCAACA CCGCCATCTG ATCCAGGAGA 



fusion site 



1st CBD Linker 



BamHI Sac I 



•PDP DDES SKP CCD Q C A C • 
1651 ACCGGATCCA GACGATGAGA GCTCTAAACC CTGTTGCGAT CAATGCGCAT 
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FIGURE 1. (cont.) 



1701 GTACGAAATC AAATCCTCCA CAGTGTCGGT GTTCCGATAT GCGTCTGAAT 



1751 AGCTGTCATA GTGCATGCAA AAGCTGTATC TGCGCCCTGA GTTATCCAGC 



•QCF CVDI TDF C Y E PCKP 
1801 TCAATGTTTT TGCGTCGACA TCACGGACTT CTGCTATGAG CCATGTAAAC 



■ SED DKE NHH HHHH Stop (SEQ ID NO: 2) 
1851 CAAGCGAGGA CGATAAAGAG AACCATCATC ACCATCACCA TTAAAAGTTA 

IJ\T terminator 



1901 ACAGAGGACG GATTTCCTGA AGGAAATCCG TTTTTTTATT TTTAAGCTTG (SEQ ID NO:l) 
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FIGURE 3. 



12BBIck81 



BamHl SacI 



DPD DESS KPC CDQ C A C Y • 
1 GGATCCAGAC GATGAGAGCT CTAAACCCTG TTGCGATCAA TGCGCATGTT 
CCTAGGTCTG CTACTCTCGA GATTTGGGAC AACGCTAGTT ACGCGTACAA 

12BBICk81 



PstI ECORI 



• NLY GWT CRCS DMR LNS 
51 ATAATTTGTA TGGGTGGACT TGTCGCTGCA GCGATATGCG TCTGAATTCC 
TATTAAACAT ACCCACCTGA ACAGCGACGT CGCTATACGC AGACTTAAGG 

12BBIck81 



CHSA CKS CAC YNLY GWT 
101 TGTCATAGTG CCTGCAAAAG CTGCGCATGT TATAACCTGT ACGGGTGGAC 
ACAGTATCAC GGACGTTTTC GACGCGTACA ATATTGGACA TGCCCACCTG 

12BBIck81 



Sail 

•CFC VDIT DFC YEP CKPS- 

151 CTGTTTTTGC GTCGACATCA CGGACTTCTG CTATGAGCCA TGTAAACCAA 

GACAAAAACG CAGCTGTAGT GCCTGAAGAC GATACTCGGT ACATTTGGTT 

12BBICk81 



• E D D KEN * (SEQ ID NO: 4) 
201 GCGAGGACGA TAAAGAGAAC TAA (SEQ ID NO: 3) 
CGCTCCTGCT ATTTCTCTTG ATT 
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| □ Active BB1 II Inactive BBI 

200 i 




37 C 0.2 mM 1 mM 5 mM Cys - 1 Glut- 1 DTT- 1 
BME (1X) BME mM mM mM 

* ratio of BCE : BBI BME 



FIGURE 5. 
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18000 




0 5 10 15 20 25 

binder [uM] 
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FIGURE 7. 

hiPDI 



AprE signal cleavage site 



BssHU Nhel BsrGl 



SAQA SDV VQL KKD T F D D 
1 AGCGCGCAGG CTAGCGATGT TGTACAACTG AAAAAA GflCA CTTTCGACGA 
TCGCGCGTCC GATCGCTACA ACATGTTGAC TTTTTTCTGT GAAAGCTGCT 

hiPDI 



. F I K TNDL V L A E F F A P W C • 
51 CTTCATCAAA ACAAATGACC TTGTTCTTGC TGAATTTTTC GCGCCGTGGT 
GAAGTAGTTT TGTTTACTGG AACAAGAACG ACTTAAAAAG CGCGGCACCA 

hiPDI 



G H C KAL APEY EEA ATT 

101 GCGGTCACTG CAAAGCTCTT GCTCCTGAGT ACGAGGAAGC TGCAACTACA 

CGCCAGTGAC GTTTCGAGAA CGAGGACTCA TGCTCCTTCG ACGTTGATGT 

hiPDI 



LKEK NIK LAK VDCT EET 
151 CTGAAAGAAA AGAACATCAA ACTTGCTAAA GTAGACTGCA CAGAAGAGAC 
GACTTTCTTT TCTTGTAGTT TGAACGATTT CATCTGACGT GTCTTCTCTG 

hiPDI 



•DLC QQHG VEG YPT L K V F • 
2 01 TGATCTTTGC CAACAACATG GTGTTGAGGG CTACCCAACT CTTAAAGTTT 
ACTAGAAACG GTTGTTGTAC CACAACTCCC GATGGGTTGA GAATTTCAAA 

hiPDI 



. RGL DNV SPYK GQR KAA 
251 TCCGTGGCCT TGACAACGTA TCTCCTTACA AAGGTCAACG TAAAGCTGCT 
AGGCACCGGA ACTGTTG CAT AGAGGAATGT TTCCAGTTGC ATTTCGACGA 

hiPDI 



AITS YMI KQS LPAV SEV 
3 01 GCAATCACTT CATACATGAT CAAACAATCT CTGCCTGCTG TATCTGAAGT 
CGTTAGTGAA GTATGTACTA GTTTGTTAGA GACGGACGAC ATAGACTTCA 

hiPDI 



• TKD NLEE FKK ADK A V L V - 
351 TACAAAAGAC AACCTTGAAG AATTTAAAAA AGCTGACAAA GCTGTTCTTG 
ATGTTTTCTG TTGGAACTTC TTAAATTTTT TCGACTGTTT CGACAAGAAC 

hiPDI 



A Y V DAS DKAS SEV FTQ 
4 01 TTGCTTATGT AGATGCTTCT GACAAAGCAT CTAG CGAAGT TTTCACTCAA 
AACGAATACA TCTACGAAGA CTGTTTCGTA GATCGCTTCA AAAGTGAGTT 

hiPDI 



VAEK LRD NYP FGSS SDA 
4 51 GTTGCTGAAA AACTGCGCGA TAACTACCCA TTCGGCTCTA GCTCTGATGC 
CAACGACTTT TTGACGCGCT ATTGATGGGT AAGCCGAGAT CGAGACTACG 
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FIGURE 7 (cont.) 

hiPDI 



•ALA EAEG V K A PAI V L Y K 
501 TGCACTGGCT GAAGCTGAGG GCGTTAAAGC ACCTGCTATT GTTCTTTACA 
ACGTGACCGA CTTCGACTCC CGCAATTTCG TGGACGATAA CAAGAAATGT 

hiPDI 



• 0 F D EGK A V F S EKF EVE 
551 AAGACTTTGA TGAAGGTAAA GCGGTTTTCT CTGAAAAATT CGAAGTAGAG 
TTCTGAAACT ACTTCCATTT CGCCAAAAGA GACTTTTTAA GCTTCATCTC 

hiPDI 



AIEK FAK TGA TPLI GEI 
601 GCAATC GAAA AATTCGCTAA AACAGGTGCT ACTCCACTTA TTGGCGAAAT 
CGTTAGCTTT TTAAGCGATT TTGTCCACGA TGAGGTGAAT AACCGCTTTA 

hiPDI 



•GPE TYSD YMS A G I PLAY- 
651 CGGACCTGAA ACTTACTCTG ATTACATGTC AGCTGGCATC CCTCTGGCAT 
GCCTGGACTT TGAATGAGAC TAATGTACAG TCGACCGTAG GGAGACCGTA 

hiPDI 



• I F A ETA EERK ELS DKL 
701 ACATTTTCGC T GAAA CAGCT GAAGAGCGTA AAGAACTCAG CGACAAACTT 
TGTAAAAGCG ACTTTGTCGA CTTCTCGCAT TTCTTGAGTC GCTGTTTGAA 

hiPDI 



KPIA EAQ RGV INFG TID 
751 AAACCAATCG CTGAAGCTCA ACGTGGCGTT ATTAACTTTG GTACTATTGA 
TTTGGTTAGC GACTTCGAGT TGCACCGCAA TAATTGAAAC CATGATAACT 

hiPDI 



A K A FGAH AGN LNL KTDK- 
801 CGCTAAAGCA TTTGGTGCTC ACGCTGGAAA CCTGAATCTG AAAACTGACA 
GCGATTTCGT AAACCACGAG TGCGACCTTT GGACTTAGAC TTTTGACTGT 

hiPDI 



• FPA FAI QEVA KNQ KFP 
851 AATTCCCTGC TTTCGCAATC CAAGAAGTTG CTAAAAACCA AAAATTCCCT 
TTAAGGGACG AAAGCGTTAG GTTCTTCAAC GATTTTTGGT TTTTAAGGGA 

hiPDI 



FDQE KEI TFE A I K A FVD 
901 TTTGATCAAG AAAAA GAAAT TACTTTTGAA GCGATCAAAG CATTCGTTGA 
AAACTAGTTC TTTTTCTTTA ATGAAAACTT CGCTAGTTTC GTAAGCAACT 

hiPDI 



' D F V AGKI EPS IKS EPIP 
951 CGATTTTGTT GCTGGTAAAA TCGAACCAAG CATCAAATCA GAACCAATCC 
GCTAAAACAA CGACCATTTT AGCTTGGTTC GTAGTTTAGT CTTGGTTAGG 

hiPDI 



■ EKQ EGP VTVV VAK NYN 
1001 CT GAAAAA CA AGAAGGTCCT GTTACTGTAG TTGTAGCTAA AAACTACAAT 
GACTTTTTGT TCTTCCAGGA CAATGACATC AACATCGATT TTTGATGTTA 
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FIGURE 7. (cont.) 



hiPDI 



EIVL DDT KDV LIEF YAP 
1051 GAAATCGTTC TGGACGATAC TAAAGATGTA TTAATTGAAT TTTACGCTCC 
CTTTAGCAAG ACCTGCTATG ATTTCTACAT AATTAACTTA AAATGCGAGG 

hiPDI 



•WCG HCKA LAP KYE ELGA- 
1101 TTGGTGCGGT CACTGCAAAG CTCTXGCTCC TAAATACGAA GAACTTGGTG 
AACCACGCCA GTGACGTTTC GAGAACGAGG ATTTATGCTT CTTGAACCAC 

hiPDI 



• L Y A KSE FKDR VVI AKV 
1151 CTCTGTATGC AAAAAGCGAG TTCAAAGACC GTGTTGTAAT TGCTAAAGTT 
GAGACATACG TTTTTCGCTC AAGTTTCTGG CACAACATTA ACGATTTCAA 

hiPDI 



DATA NDV PDE IQGF PTI 
1201 GATGCAACAG CTAACGATGT TCCAGATGAA ATTCAAGGAT TCCCTACTAT 
CTACGTTGTC GATTGCTACA AGGTCTACTT TAAGTTCCTA AGGGATGATA 

hiPDI 



• K L Y PAGA KGQ PVT Y S G S • 
1251 CAAACTATAC CCAGCTGGTG CAAAAGGTCA ACCTGTTACT TACTCTGGTT 
GTTTGATATG GGTCGACCAC GTTTTCCAGT TGGACAATGA ATGAGACCAA 

hiPDI 



• RTV EDL IKFI AEN GKY 
1301 CACGCACTGT TGAAGACCTT ATCAAATTCA TTGCTGAAAA CGGTAAATAC 
GTGCGTGACA ACTTCTGGAA TAGTTTAAGT AACGACTTTT GCCATTTATG 

hiPDI 



Spel 



K A A I SED AEE TSSA TET 
1351 AAAGCTGCAA TCTCAGAAGA TGCTGAAGAG ACTAGTTCAG CAACTGAAAC 
TTTCGACGTT AGAGTCTTCT ACGACTTCTC TGATCAAGTC GTTGACTTTG 

hiPDI 



•TTE TATK SEE A A K E T A T • 
1401 AACTACAGAA ACTGCTACAA AGTCAGAAGA AGCTGCAAAA GAAACTGCAA 
TTGATGTCTT TGACGATGTT TCAGTCTTCT TCGACGTTTT CTTTGACGTT 

Enteropeptidaae cleavage linker 



hiPDI N-term BB1 



• EHD ELG SGSG DDD DKD 
1451 CAGAACACGA CGAACTTGGA TCTGGTTCCG GAGATGACGA TGACAAAGAC 
GTCTTGTGCT GCTTGAACCT AGACCAAGGC CTCTACTGCT ACTGTTTCTG 

N-term BBI 



SacI 



D E S S 
1501 GATGAGAGCT CT (SEQ ID NO: 5) 
CTACTCTCGA GA (SEQ ID NO: 6) 
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FIGURE 8. 

aprE promoter 



aprE promoter 



aprE promoter 



aprE promoter 



aprE promoter 



aprE promoter 



aprE promoter 



401 TACTCATGTC TATTTTCGTT CTTTTCTGTA TGAAAATAGT TATTTCGAGT 
ATGAGTACAG ATAAAAGCAA GAAAAGACAT ACTTTTATCA ATAAAGCTCA 

aprE promoter 



aprE promoter 



aprE promoter 



aprE promoter Cutinase signal peptide 



AprE signal peptide 
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FIGURE 8 (cont.) 



V RSK KLWI SLL F A L 
601 GGTAAAGAGT GAG AAG C AAA AAATTGTGGA TCAGCTTGTT GTTTGCGTTA 
CCATTTCTCA CTCTTCGTTT TTTAACACCT AGTCGAACAA CAAACGCAAT 

Cutinase 



Cutinase signal peptide 



T L A A SCL SVC A T V A A A P 
651 ACGCTGGCGG CCTCTTGCCT GTCCGTCTGT GCCACTGTCG CGGCGGCTCC 
TGCGACCGCC GGAGAACGGA CAGGCAGACA CGGTGACAGC GCCGCCGAGG 

Cutinase 



•LPD TPGA PFP A V A N F D R • 
701 CCTGCCGGAT ACACCGGGAG CGCCATTTCC GGCTGTCGCC AATTTCGACC 
GGACGGCCTA TGTGGCCCTC GCGGTAAAGG CCGACAGCGG TTAAAGCTGG 

Cutinase 



• SGP YTT SSQS EGP SCR 
751 GCAGTGGCCC CTACACCACC AGCAGCCAGA GCGAGGGGCC GAGCTGTCGC 
CGTCACCGGG GATGTGGTGG TCGTCGGTCT CGCTCCCCGG CTCGACAGCG 

Cutinase 



IYRP RDL GQG GVRH PVI 
801 ATCTATCGGC CCCGCGACCT GGGTCAGGC-G GGCGTGCGTC ATCCGGTGAT 
TAGATAGCCG GGGCGCTGGA CCCAGTCCCC CCGCACGCAG TAGGCCACTA 

Cutinase 



•LWG NGTG AGP STY A G L L • 
851 TCTCTGGGGC AATGGCACCG GTGCCGGGCC GTCCACCTAT GCCGGCTTGC 
AGAGACCCCG TTACCGTGGC CACGGCCCGG CAGGTGGATA CGGCCGAACG 

Cutinase 



■ SHW ASH GFVV AAA ETS 
901 TATCGCACTG GGCAAGCCAC GGTTTCGTGG TGGCGGCGGC GGAAACCTCC 
ATAGCGTGAC CCGTTCGGTG CCAAAGCACC ACCGCCGCCG CCTTTGGAGG 

Cutinase 



NAGT GRE MLA CLDY LVR 
951 AATGCCGGTA CCGGGCGGGA AATGCTCGCC TGCCTGGACT ATCTGGTACG 
TTACGGCCAT GGCCCGCCCT TTACGAGCGG ACGGACCTGA TAGACCATGC 

Cutinase 



•END TPYG TYS GKL NTGR- 
1001 TGAGAACGAC ACCCCCTACG GCACCTATTC CGGCAAGCTC AATACCGGGC 
ACTCTTGCTG TGGGGGATGC CGTGGATAAG GCCGTTCGAG TTATGGCCCG 

Cutinase 



• VGT SGH SQGG GGS IMA 
1051 GAGTCGGCAC TTCTGGGCAT TCCCAGGGTG GTGGCGGCTC GATCATGGCC 
CTCAGCCGTG AAGACCCGTA AGGGTCCCAC CACCGCCGAG CTAGTACCGG 

Cutinase 



GQDT RVR TTA PIQP YTL. 
1101 GGGCAGGATA CGAGGGTGCG TACCACGGCG CCGATCCAGC CCTACACCCT 
CCCGTCCTAT GCTCCCACGC ATGGTGCCGC GGCTAGGTCG GGATGTGGGA 



Cutinase 
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FIGURE 8 (cont.) 



•GLG HDSA SQR RQQ G P M F • 
1151 CGGCCTGGGG CACGACAGCG CCTCGCAGCG GCGGCAGCAG GGGCCGATGT 
GCCGGACCCC GTGCTGTCGC GGAGCGTCGC CGCCGTCGTC CCCGGCTACA 

Cutinase 



• LMS GGG DTIA FPY LNA 
1201 TCCTGATGTC CGGTGGCGGT GACACCATCG CCTTTCCCTA CCTCAACGCT 
AGGACTACAG GCCACCGCCA CTGTGGTAGC GGAAAGGGAT GGAGTTGCGA 

Cutinase 



QPVY RRA NVP VFWG ERR 
1251 CAGCCGGTCT ACCGGCGTGC CAATGTGCCG GTGTTCTGGG GCGAACGGCG 
GTCGGCCAGA TGGCCGCACG GTTACACGGC CACAAGACCC CGCTTGCCGC 

Cutinase 



•YVS HFEP VGS GGA Y R G P ■ 
13 01 TTACGTCAGC CACTTCGAGC CGGTCGGTAG CGGTGGGGCC TATCGCGGCC 
AATGCAGTCG GTGAAGCTCG GCCAGCCATC GCCACCCCGG ATAGCGCCGG 

Cutinase 



• STA WFR FQLM DDQ DAR 
1351 CGAGCACGGC ATGGTTCCGC TTCCAGCTGA TGGATGACCA AGACGCCCGC 
GCTCGTGCCG TACCAAGGCG AAGGTCGACT ACCTACTGGT TCTGCGGGCG 

Cutinase 



Alw44I 



ATFY GAQ CSL CTSL LWS 
14 01 GCTACCTTCT ACGGCGCGCA GTGCAGTCTG TGCACTTCTC TGCTTTGGTC 
CGATGGAAGA TGCCGCGCGT CACGTCAGAC ACGTGAAGAG ACGAAACCAG 

Linker 2 



Cutinase 



BamHX 



•VER RGLD NND PIP D 
14 51 TGTTGAACGC AGAGGTCTTG ACAACAATGA TCCTATTCCG GATCC 
ACAACTTGCG TCTCCAGAAC TGTTGTTACT AGGATAAGGC CTAGG 



(SEQ ID NO: 7) 
(SEQ ID NO: 8! 
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BACTERIAL EXPRESSION OF 
BOWMA VBIRK PROTEASE INHIBITORS 
AND VARIANTS THEREOF 

The present application claims priority under 35 U.S.C. § 
1 19. to co-pending U.S. Provisional Patent Application Ser. 
No. 60/518,154, tiled Nov. 6, 2003, co-pending U.S. Provi- 
sional Patent Application Ser. No. 60/520,403. filed Nov. 13, 
2003, co-pending U.S. Provisional Patent Application Ser. 
No. 60/530,954. filed Dec. 19, 2003, co-pending U.S. Provi- 
sional Patent \pplication Ser. No. 60. 5 1 1 .207. filed Dec. 19, 
2003. and co-pendinu U.S. Provisional Patent Application 
Ser. No. 60/531,189, filed Dec. 19. 2003. 

FIELD OF THE INVENTION 

The present invention provides compositions and methods 
related to expression of protease inhibitors and variants 
thereof in bacterial species. The present invention further 
provides fusion nucleic acids, vectors, fusion polypeptides, 
and processes for obtaining the protease inhibitors. 

BACKGROUND OF THE INVENTION 

Proteases are involved in a wide variety of biological pro- 
cesses. Disruption of the balance betv\ ecu proteases and pro- 
tease inhibitors is often associated with pathologic tissue- 
destruction. Indeed, various studies have focused on the role 
ol'proteases in (issue injury, and it is thought that the balance 
between proteases and protease inhibitors is a major determi- 
nant in maintaining tissue integrity. Serine proteases from 
inflammatory cells, including neutrophils, are implicated in 
various inflammatory disorders, such as pulmonary emphy- 
sema, arthritis, atopic dermatitis and psoriasis. 

Proteases also appear to function in the spread of certain 
cancers. Normal cells exist in contact with a complex protein 
network, called the extracellular matrix (ECM). The ECM is 
a barrier to cell movement and cancer cells must devise ways 
to break their attachments, degrade, and move through the 
ECM in order to metastasize. Proteases are enzymes that 
degrade other proteins and have long been thought to aid in 
freeing the tumor cells from their original location by chew - 
ing up the K( 'M. Recent studies have suggested that they may 
promote cell shape changes and motility through the activa- 
tion of a protein in the tumor cell membrane called Protease- 
Activated Receptor-2 (PAR2). This leads to a cascade of 
intracellular reactions that activates the motility apparatus of 
the cell. Thus, it is hypothesized that one of the first steps in 
tumor metastasis is a reorganization of the cell shape, such 
that it tonus a distinct protrusion at one edge facing the 
direction of migration. The cell then migrates through a blood 
vessel w all and travels to distal locations, eventually reattach- 
ing and forming a metastatic tumor. For example, human 
prostatic epithelial cells constitutively secrete prostate-spe- 
cilic antigen (PSAS, a kallikrein-like serine protease, which is 
a normal component of the seminal plasma The protease acts 
to degrade the extracellular matrix and facilitate invasion of 
cancerous cells. 

Synthetic and natural protease inhibitors hav e been shown 
to inhibit tumor promotion in vivo and in vitro. Previous 
investigations have indicated that certain protease inhibitors 
belonging to a family of structurally-related proteins classi- 
fied as serine protease inhibitors or SHRP1NS. are known to 
inhibit several proteases including trypsin, cathepsin G. 
thrombin and tissue kallikrein. as « ell as neutrophil elastase 
The SERPINS are extremely effective at preventing/sup- 
pressing carcinogen-induced transformation in vitro and car- 
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cinogenesis in animal mode! ,\ stem i Sj stemic delivery of 
purified protease inhibitors apparently reduces joint inflam- 
mation and cartilage and bone destruction as well. 

Topical administration of protease inhibitors finds use in 
5 such conditions as atopic dermatitis, a common form of 
inflammation of the skin, which may be localized to a few- 
patches or involve large portions of the body. Hie depigment- 
ing activity of protease inhibitors and their capability to pre- 
vent ultraviolet-induced pigmentation have been demon- 
ic stated both in vitro and in vivo(Seee.g„ Paine et af. .1. Invest. 
Dermatol.. 1 16:587-5^5 |2(K)lj) Protease inhibitors have 
also been reported to facilitate wound healing, i or example, 
secretory leukocyte protease inhibitor was demonstrated to 
reverse the tissue destruction and speed the wound healing 
is process when topically applied. In addition, serine protease 
inhibitors can also help to reduce pain in lupus erythematosus 
patients (See e.g., U.S. Pat. No. 6,537,968). 

As noted above, protease inhibitors interfere with the 
action of proteases Naturally occurring protease inhibitors 
.> can be found in a \ ariety of foods such as cereal grains (oats, 
barley, and maize). Brussels sprouts, onion, beetroot, wheat, 
finger millet, and peanuts. One source of interest is the soy- 
bean. The average level of protease inhibitors present in soy- 
beans is around 1 .4 percent and 0.6 percent for Kunitz and 
25 Bowman-Birk respectively, two of the most important pro- 
tease inhibitors. Notably, these low levels make it impractical 
to isolate the natural protease inhibitor for clinical applica- 

Thus, there is a need for methods and compositions suit- 
• ' able for the large-scale production of protease inhibitors and 
their variants. In particular, there remains a need for compo- 
sitions and methods that reduce and/or eliminate risks asso- 
ciated with blood-borne infectious agents when these pro- 
teases are produced in mammalian tissue culture cells. 

35 

SUMMARY OF THE INVENTION 

The present invention provides compositions and methods 
related to expression of protease inhibitors and variants 
in thereof in bacterial species, file present invention further 
provides fusion nucleic acids, vectors, fusion polypeptides, 
and processes for obtaining the protease inhibitors. 

Provided herein are nucleic acids, cells and methods for the 
production of protease inhibitors and variants thereof 
45 The present invention provides nucleic acids encoding at 
least one functional protease inhibitor. In one aspect, a 
nucleic acid composing regulatory sequences operatively 
linked to a first, second, third and fourth nucleic acid 
sequences are provided. In some embodiments, terminator 
si, sequences are provided following the fourth nucleic acid 
sequence. 

In alternative embodiments, the first nucleic ac id sequence 
encodes a signal polypeptide functional as a secretory 
sequence in a first host organism, the second nucleic acid 

-s encodes a secreted polypeptide or functional portion thereof 
normally secrete.:! from the first or a second host organism, the 
third nucleic acid encodes a eleavable linker and the fourth 
nucleic acid encodes a protease inhibitor or fragment thereof . 
In further embodiments, the present invention provides at 

60 least one expression cassette comprising nucleic acid 
sequences encoding at least one protease inhibitor 

In additional embodiments, the present invention provides 
polynucleotides encoding at least one protease inhibitor van- 
t t In i vu Id refe ml iment lit I 

<-o line ieofiile encodes a Bowman-Birk Inhibitor ( BB1) variant, 
wherein at least one loop of the wild-type BBI has been 
altered. 
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The present invention also prov ides methods of expressing 
functional protease inhibitors or variants thereof. In some 
preferred embodiments, host ceils suitable lor production of 
functional pn itease inhibitors and/or variants thereof are pro- 
vided. In some embodiments, a host cell is (i) trdnsformed 5 
with at least one expression cassette comprising a nucleic acid 
sequence encoding at least one protease inhibitor or variant 
thereof, and lii) cultured under appropriate conditions to 
express at least one protease inhibitor or variants thereof. In 
sonic embodiments ol the methods, the method further com- v< 
prises recovering the protease inhibitor or variant thereof. 

In alternative embodiments, a host cell is (i) transformed 
with a first expression cassette comprising at least one nucleic 
acid sequence encoding a protease inhibitor or variant 
thereof, 'in transformed with a second expression cassette 15 
comprising a nucleic acid sequence encoding at least one 
thiol -disulfide oxidorednctase or chaperone. and 'in i cultured 
under appropriate conditions to express the protease inhibi- 
tors or variant thereof. In some embodiments, the protease 
inhibitors orvariants thereof are recovered. In some preferred 20 
embodiments, the protease inhibitors or variant thereof are 
expressed as a fusion protein. In further embodiments, the 
methods further comprise recovering the protease inhibitor or 

In still further embodiments, functional protease inhibitors 25 
and variants thereof are provided. In some embodiments, the 
functional protease inhibitor or variant thereof is expressed as 
a fusion protein consisting of the signal sequence, a eellulase 
catalytic domain, a cleavable linker region, and then by the 
mature protease inhibitor or variant thereof. 30 

In additional embodiments, the expressed proteins are 
treated with a protease and/or acid/heat treatment to liberate a 
protease inhibitor or variant thereof from the fusion protein. 

In farther embodiments, the present invention provides a 
polypeptide ha\ ing protease inhibitory activity, selected from 35 
the group consisting of 

a) Bowman-Birk Inhibitor variants; 

b) Bowman-Birk Inhibitor; and 

c) A scaffold comprising at least one variant sequence. 

The present invention provides compositions comprising a v 
fusion protein, wherein the fusion protein comprises a pro- 
tease inhibitor, and a peptide of interest. In some embodi- 
ments, the fusion protein comprises an amino acid selected 
from the group consisting of SHQ ID NOS:2. 4, 6 and 8. In 
alternative embodiments, the fusion protein is encoded by a 4S 
nucleotide sequence selected From the group consisting of 
SEQ ID NOS:l, 3. 5. and 7. In some preferred embodiments, 
the protease inhibitor is selected from the group consisting of 
Bowman-Birk inhibitor (BBI), soybean trypsin inhibitor 
(S IT ). and eglin < In some particularly prelerred embodi- v> 
merits, the protease inhibitor is BBI. and « herein [lie BBI 
comprises at least one loop selected from the group consisting 
oi'a trypsin loop and a chymotrypsin loop. In further prelerred 
embodiments, the protease inhibitor is a sea hold lor the pep- 
tide of interest. In some preferred embodiments, the loop 55 
comprises at least one peptide of interest. In some particularly 
preferred embodiments, the peptide of interest comprises the 
ammo acid sequence set forth in SI :Q I [) NOO In some more 
preferred embodiments, the fusion protein comprises the 
amino acid sequence set forth in SEQ ID NO:4. 60 

fhe present invention also pro\ ides methods lor producing 
at least one protease inhibit' nn if ictenal cell comprising: a) 
introducing a DNA construct into a bacterial cell, wherein the 
DNA construct comprises a heterologous DNA sequence 
encoding a protease inhibitor derived from a Bowman-Birk a 
nn ibitoi I11 nl the reo I cull iriny the 1 t 1 1 
cell under suitable culture conditions to allow expression of 
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the heterologous DNA sequence; and c) producing the pro- 
tease inhibitor. In some embodiments, the construct further 
comprises at least one catalytic domain. In some embodi- 
ments, catalytic domain is selected from the group consisting 
of eellulase. cutinase. and disulfide isomerase. In some pre- 
ferred embodiments, the method further comprises the step of 
recovering the protease inhibitor. In some particularly pre- 
ferred embodiments, the method further comprises the step of 
activating the protease inhibitor. In some most preferred 
embodiments, the activating is accomplished by exposing the 
protease inhibitor to at least one reagent selected from the 
group consisting of compositions that reduce disulfide bonds, 
compositions that oxidize disulfide bonds, and compositions 
that alter the redox potential. In some embodiments, the bac- 
terial cell is a member of the genus Bacillus. In additional 
embodiments, protease inhibitor is selected from the group 
consisting of: i) a protease inhibitor having at least 90% 
sequence identity with SEQ ID NO: 1 1 ; and ii) a protease 
inhibitor having at least 90", > sequence identity with SEQ ID 
NO: 13. In yet further embodiments, the protease inhibitor 
comprises a variant sequence. In additional embodiments, the 
protease inhibitor is Bowman-Birk inhibitor and wherein 
Sl-Q ID N( ):') is substituted lor at least one loop, wherein the 
loop is selected from the group consisting of the tr> psin loop 
and the chymotrypsin loop. In some embodiments, the 
method further comprises the step of introducing a second 
nucleic acid sequence encoding a thiol-disu Hide oxidorednc- 
tase or chaperone into the bacterial cell. In some preferred 
embodiments, the protease inhibitor is expressed as a fusion 
protein. In some preferred embodiments, the fusion protein 
further comprises a eellulase catalytic domain, a cleavage 
site, and the protease inhibitor. In some particularly preferred 
embodiments, the fusion protein is processed by a protease or 
acid/heat treatment to liberate the protease inhibitor. In still 
further preferred embodiments, the fusion protein further 
comprises at least one linker sequence. In some embodi- 
ments, the linker sequence is selected from the group consist- 
ing of SEQ ID NOS: 14 1-1 43. 

The present invention also provides protease inhibitor 
compositions comprising the protease inhibitor produced 
according to any and all of the methods described herein. In 
some embodiments, the various primers and oligonucleotides 
described herein find use in the production of the fusion 
proteins of the present invention. In addition, the present 
invention pro\ ides numerous peptides suitable for use in the 
present invention. 

The present invention also pro\ ides methods for inhibiting 
the proteolytic ach\ ity of a target protein comprising contact- 
ing the target protein with any of the protease inhibitor com- 
positions described herein, and binding the target protein 
wherein the proteolytic activity of the target protein is inhib- 
ited. 

The present invention further provides isolated polynucle- 
otides encoding a protease inhibitor selected from the group 
consisting of polypeptide sequences set forth in SEQ ID 
NOS: 10 and 12. 

The present invention also provides expression vectors 
comprising a polynucleotide sequence, wherein the poly- 
nuclei 1 uenci 1 elected I im tin l! in 1 pi 1 

nucleotide sequence encoding a protease inhibitor having at 
least 90% sequence identity with SEQ ID NO: 1 1. wherein at 
least one of the loop residues have been replaced with a 
variant sequence: and b . a polynucleotide sequence encoding 
a protease inhibitor having at least 90'!.', sequence identity 
with SEQ ID NO: I 3. wherein at least one of the loop residues 
have been replaced with a variant sequence. In some embodi- 
ments, the expression vectors further comprise from the 5' 



US 7,413,877 B2 



j the 3' terminus: a first nucleic acid sequence 
encoding a signal peptide functional as a secretory sequence 
in a bacterial cell: a second nucleic acid sequence encoding a 
secreted polypetide or functional portion thereof, a third 
nucleic acid sequence encoding a cleavablc linker, and the < 
DNA sequence which encodes the protease inhibitor. The 
present invention also provides host ceils transformed with 
the expression vectors of the present invention. In some pre- 
ferred embodiments, the host cell is a Ha, ■:I!its species cell. 

Hie present in\ ention also prov ides compositions compris- :<> 
ing at least one scaffold protein and at least one peptide, 
wherein the scaffold comprises Bowman-Birk inhibitor. In 
some preferred embodiments, the peptide comprises a pep- 
tide that binds to VegF. 

Other objects, features and advantages of the present 15 
invention are apparent from the following detailed descrip- 
tion. It should be understood, however, that the detailed 
description and specific examples. v\ hi lo indicating preferred 
embodiments of the invention, are given by way of illustration 
only, since various changes and modifications within the 20 
scope and spirit of the invention will be apparent to those 
skilled in the art from this detailed description. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 provides the DNA and amino acid sequences of the 
aprE-BCE103-BBl-Histag expression cassette (BcoRl-Hin- 
dlll) cloned into the pJM103 integration vector (SEQ ID 
NOS:l and 2). 

FIG. 2 provides a schematic map of the pJM103BBIhis 30 
expression vector. 

FIG. 3 provides the DNA and amino acid sequences of 
12BBIck81 from the BCE103 fusion site (at the BamHI) lo 
the end of the gene (SEQ ID NOS:3 and 4). The CK3728I 
peptide sequences (ACYNEYGWTC (SEQ ID NO:9) arc 35 
inserted into both the trypsin and chymotrypsin inhibitory 

FIG. 4 provides a graph showing titers of active versus 
inactive 2BB]ck8 1 (by trypsin inhibition ) and the ratio of the 
activities of B( T 103 cellulase to 2BBck81 with various thiol 40 
reducing agents added during the growth of the culture. In this 
figure. BME 2-mercaptoethnnol. Cyt cysteine. 
( lint reduced glutathione, DTT dithiothreitol). 

FIG. 5 prov id.es a graph showing activation of B( T-lnk2- 
2BBIck81 with 2-mercaptoethanol (bME) after partial puri- 45 
float ion by ion exchange chromatography. 

FIG. 6 provides a ph how in i ults from a competition 
analysis of 2BBlck81 versus anti-Veid' antibodv bindinu to 
VegF. 

I'iG. 7 prov ides the sequence of the synthetic DNA frag- , , 
mem carrying the // insoL-ns 1'Dl (hiPDI) that was inserted 
into the/?, suhtilis- BB1 expression vector, as well as the amino 
acid sequence ( SEQ ID NOS:5 and 6) 

FIG. 8 prov ides the DNA and annuo acid sequences of the 
nprE-cutinase expression cassette that was filiated into the 55 
EcoRI-BamHl sites of p2JM103-lnk2-2BBIck81 (SEQ ID 
NOS:7and 8). 

DESCRIPTION OF THE INVENTION 

The present inv ention prov ides compositions and methods 
related to expression of protease inhibitors and variants 
thereof in bacterial species. The present invention further 
provides fusion nucleic acids, vectors, fusion polypeptides, 
a t 1 r sses 1 lini the protea inhibitors 
""'~.s otherwise indicated, the practice of the present 
„ :™.«lves conventional techniques commonly used 



in molecular biology, microbiology, and recombinant DNA. 
which are within the skill of the art. Such techniques are 
known to those of skill in the art and are described in numer- 
ous texts and reference works (See e.g., Sambrook et af, 
"Molecular Cloning: A I aboralory Manual". Second Edition 
(( 'old Spring 1 larbori. | l l )80[); and Ausubel et ah. "Current 
Protocols in Molecular Biology" | P>87|). All patents, patent 
applications, articles and publications mentioned herein, both 
supra and infra, are hereby expressly incorporated herein by 
reference. 

Unless defined otherwise herein, all technical and scien- 
tific terms used herein have the same meaning as commonly 
understood by one of ordinary skill in the art to which this 
invention pertains. For example. Singleton and Sainsbury, 
Dictionary of Microbiology ,,.,,/ Molecular Biology, 2d Ed., 
John Wiley and Sons. NY ( 1 994); and Hale and Marham. The 
Harper Collins l)h lioiiarvot Wology. Harper Perennial. N. Y. 
(1991) provide those of skill 111 the art with a general dictio- 
naries of many of the terms used in the invention. Although 
any methods and materials similar or equivalent to those 
described herein find use in the practice of the present inven- 
tion, the preferred methods and materials are described 
herein. Accordingly, the terms defined immediately below arc- 
more fully described by reference to the Specification as a 
whole. Also, as used herein, the singular "a", "an" and "the" 
includes the plural reference unless the context clearly indi- 
cates otherwise. Numeric ranges are inclusive of the numbers 
defining the range. Unless otherw ise indicated, nucleic acids 
are written left to right in 5' to 3' orientation; amino acid 
sequences are written left to right in amino to carboxy orien- 
tation, respectively. It is to be understi lod that this invention is 
not limited to the particular methodology, protocols, and 
reagents described, as these may vary, depending upon the 
context they are used by those of skill in the art. 

Furthermore, the headings provided herein are not limita- 
tions of the various aspects or embodiments of the invention 
which can be had by reference to the specification as a whole. 
Accordingly, the terms defined immediately below are more 
fully defined by reference to the specification as a whole. 
Nonetheless, in order to facilitate understanding of the inven- 
tion, a number of terms are defined below. 

Definitions 

As used herein, the terms "expression cassette" and 
"expression vector" refer to nucleic acid constructs generated 
recombinants or synthetically, with a series of specified 
nucleic acid elements thai permit transcription of a particular 
nucleic acid in a target cell, 'file recombinant expression 
cassette can be incorporated into a plasniid. chromosome, 
mitochondrial DNA. plastid I )NA, virus, or nucleic acid frag- 
ment. Typically, the recombinant expression cassette portion 
of an expression vector includes, among other sequences, a 
nucleic acid sequence to be transcribed and a promoter. The 
term "expression cassette" may be used interchangeably 
herein with "DNA construct" and its grammatical equiva- 

As used herein, the terms "vector" and "cloning vector" 
refer to nucleic acid constructs designed to transfer nucleic 
acid sequences into cells. 

As used herein, the term "expression vector" refers to a 
vector that has the ability to incorporate and express heter- 
ologous DNA fragments in a foreign cell. Many prokaryotic 
and eukaryotic expression vectors are commercially avail- 
able. Select ion o ('appropriate expression vectors is w ithin the 
vl Ige ol tin .v of sktl in ih_ a 1 

As used herein, the term "plasrnid" refers to a circular 
double-stranded nisi I >NA construct use.:! as a cloning vector. 
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and which forms an extrachromosomal self-replicating 
genetic clement in -nine eukaryotcs or prokaryotes, or inte- 
grates into the host chromosome. 

As used herein, the terms "nucleic acid molecule" and 
"nucleic acid sequence" include sequences of any form of 5 
nucleic acid, including, but not limited to RNA, DNA and 
cDNA molecules. It will be understood that, as a result of the 
degeneracy of the genetic code, a multitude of nucleotide 
sequences encoding a given protein may be produced. 

As used herein, a "fusion 1 )NA sequence" comprises from 
5' to y a lirst. second, third and fourth 1 )NA sequences. 

As used herein, "a first nucleic acid sequence" or "first 
DNA sequence" encodes a signal peptide functional as a 
secretory sequence in bacterial species. Particularly preferred 
signal sequences are those derived from polypeptides is 
secreted by the express ion host used to exp res s ; i nd secrete the 
fusion polypeptide. As used herein, first amino acid 
sequences correspond to secretory sequences w Inch are func- 
tional in a bacterial species. Such amino acid sequences are 
encoded by first DNA sequences as defined. 20 

As used herein, "second DNA sequences" encode 
"secreted polypeptides" (i.e., "secreted polypeptides of inter- 
est") expressed by the chosen bacterial species. As with the 
first DNA sequences, preferred secreted polypeptides are 
those w hich are naturally secreted by the bacterial expression 25 
host. However, in some embodiments, the term refers to het- 
erologous protein (i.e.. proteins that are not normally secreted 
by the particular bacterial host). For example, in some pre- 
ferred embodiments, a cellulase expressed by a Bacillus spe- 
cies other than B. subtilis is used that the secreted polypeptide 30 
of interest and B. subtilis is used as the expression host. 

As used herein, "functional portion of a secreted polypep- 
tide" and its grammatical equivalents refers to a truncated 
secreted polypeptide that retains its ability to fold into a 
normal, albeit truncated, configuration. In some embodi- 35 
ments, it is contemplated that sufficient residues of a domain 
of the secreted polypeptide must be present to allow it to fold 
in its normal configuration independently of the desired 
polypeptide to which it is attached. However, in most cases, 
the portion of the secreted polypeptide are both correctly 4C 
folded and result in increased secretion as compared to its 
absence. 

Similarly, in most cases, the truncation of the secreted 
polypeptide means that the functional portion retains a bio- 
logical function. In a preferred embodiment, the catalytic 45 
domain of a secreted polypeptide is used, although other 
functional domains may be used, for example, the substrate 
binding domains. Additionally preferred embodiments utilize 
the catalytic domain and all or part of the linker region. 

As used herein, "third DNA sequences" comprise DNA 5 
sequences encoding a eleavable linker polypeptide. It should 
be understood that the third I )NA sequence need only encode 
that amino acid sequence w Inch is necessary to be recognized 
by a particular enzyme or chemical agent to bring about 
cleavage of the fusion polypeptide. Thus, only that portion of 5 
the linker which is necessary for recognition and cleavage by 
the appropriate enzyme is required. 

As used herein, "fourth DN A sequences" encode "desired 
polypeptides." Such desired p dypeptides include protease 
inhibitors either in their mature or pro forms, and variants 6 
thereof. 

lite above -defined four DNA sequences encoding the cor- 
responding four amino acid sequence', are combined to form 
a "fusion DNA sequence." Such fusion DNA sequences are 
assembled in proper reading frame from the 5* terminus to 3' 6 
terminus in the order of first, second, third and fourth DNA 
sequences. As so assembled, the DNA. sequence encodes a 
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"fusion polypeptide," "fusion protein." and "fusion analog" 
encoding from its amino-terminus a signal peptide functional 
as a secretory sequence in a bacterial species, a secreted 
polypeptide or portion thereof normally secreted by a bacte- 
rial species, a eleavable linker polypeptide and a desired 
polypeptide. 

As used herein, the terms "desired protein" and "desired 
polypeptide" refer to a poly peptide or protein in its mature or 
pro form that is not fused to a secretion enhancing construct. 
Thus, a "desired protein" and "desired polypeptide" refer to 
the protein to be expressed and secreted by the host cell in a 
non-fused form. 

As used herein, "fusion polypeptides." "fusion proteins." 
and "fusion analogs" encode from the amino-terminus a sig- 
nal peptide functional as a secretory sequence functional in a 
host cell, a secreted polypeptide or portion thereof normally 
secreted from a host cell, a eleavable linker polypeptide and a 
desired polypeptide. In some embodiments, the fusion pro- 
tein is processed by host cell enzymes (e.g.. a protease), to 
yield the desired protein free from the other protein sequences 
in the fusion protein. As used herein, the terms "fusion ana- 
log," "fusion polypeptide," and "fusion protein" are used 
interchangeably. 

As used herein, a "promoter sequence" refers to a DNA 
sequence which is recognized by the bacterial host for expres- 
sion purposes. In preferred embodiments, it is operably 
linked to a DNA sequence encoding the fusion polypeptide. 
Such linkage comprises positioning of the promoter with 
respect to the translation initiation codon of the DNA 
sequence encoding the fusion I )NA sequence. In particularly 
preferred embodiments, the promoter sequence contains tran- 
scription and translation control sequences which mediate the 
expression of the fusion DNA sequence. 

As used herein, "terminator sequence" refers to a DNA 
sequence which is recognized by the expression host to ter- 
minate transcription. It is operably linked to the 3' end of the 
fusion DNA encoding the fusion polypeptide to be expressed. 

As used herein, the term "selectable marker-encoding 
nucleotide sequence" refers to a nucleotide sequence which is 
capable of expression in bacterial cells and where expression 
of the selectable marker confers to cells containing the 
expressed gene the ability to grow in the presence of a corre- 
sponding selective condition. 

As used herein, a nucleic acid is -'operably linked" when it 
is placed into a functional relationship with another nucleic 
acid sequence. For example. DNA encoding a secretory 
leader is operably linked to DNA for a polypeptide if it is 
expressed as a preprotein that participates in the secretion of 
the polypeptide, a pro mi ter or enhancer is operably linked to 
a coding sequence if it affects the transcription of the 
sequence: or a ribosome binding site is operably linked to a 
coding sequence if it is positioned so as to facilitate transla- 
tion. Operably linked DN Vsa u reu sually contiguous, 
and. in the ease of a secretory leader, contiguous and in 
reading phase. However, enhancers do in it have to be contigu- 
ous. Linking is accomplished by ligation at convenient 
restriction sites. If such sites do not exist, the synthetic oligo- 
nucleotide adaptor-, or linkers are used in accordance with 
conventional practice. 

As used herein, "recombinant" includes reference to a cell 
or vector, that has been modified by the introduction of a 
heterologous nucleic acid sequence or that the cell is derived 
from a cell so modified. Thus, for example, recombinant cells 
express genes that are not lUmd in identical form within the 
native (non-recombinaut) form of the cell or express native 
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genes that are otherwise abnormal!) expressed, under 
expressed or not expressed at all as a result of deliberate 
human intervention. 

As used herein, the term "expression" refers to the proeess 
by which a polypeptide is produced based on the nucleic acid 5 
sequence ol'a gene. She proeess includes both transcription 
and translation. Thus, the term "paitease inhibitor expres- 
sion" refers to transcription and translation of the specific 
protease inhibitors and variants thereof gene to be expressed, 
the products of which include precursor KNA. mRNA. 
polypeptide, post-translation processed polypeptide, and 
derivatives fhereo ! Similarly, "pn i tease inhibitor expression" 
refers to the transcription, translation and assembly of pro- 
tease inhibitors and variants thereof exemplified by the 
expression cassette shown in FIG 1 By way of example, 15 
assays for protease inhibitor expression include examination 
of bacterial colonies when exposed to the appropriate condi- 
tions. Western blots for protease inhibitor protein, as well as 
Northern blots anaK sis and reverse transcriptase polymerase 
chain reaction (RT-PCR) assays for protease inhibitor in 
mRNA. 

As used herein, the terra-, "isolated" and "purified" as used 
herein refer to a nucleic acid or polypeptide that is removed 
from at least one component with which it is naturally asso- 
ciated. 25 

As used herein, the term "substantially free" encompasses 
preparations of the desired polypeptide having less thanabout 
20% (by dry weight | othei proteins (i.e., contaminating pro- 
tein), less than about 10% other proteins, less than about 5% 
other proteins, or less than about 1 % other proteins. 30 

As used herein, the term "substantially pure" when applied 
to the proteins or fragments thereof of the present invention 
means that the proteins are essentially free of other sub- 
stances to an extent practical and appropriate for their 
intended use. In particular, the proteins are sufficiently pure 35 
and are sufficiently live from other biological constituents of 
the host cells so as to be useful in. for example, protein 
sequencing, and/or producing pharmaceutical preparations. 

As used herein, the term "target protein" refers to protein 
(e,g..en/yme. hormone, etc), whose action would be blocked t 
by the binding of the variant inhibitors provided lor herein. 

As used herein, the terms "variant sequence" and "variant 
sequences" refer to the short polypeptide sequence! s) that 
replace the binding loops of the wild-type protease inhibitor 
or other scaffold. The variant sequence does not need to be of o 
the same length as the binding loop sequence it is replacing m 
the scaffold. 

As used herein, the term "scaffold" refers to a wild-type 
protein sequence into w hich a variant sequence is introduced. 
In some embodiments, the scaffold has portions (e.g.. loops). < 
that are replaced. For example, the BB1 sequences provided 
herein find use as scaffolds for vai ' 



Protease Inhibitors 

Two protein protease inhibitors have been isolated from 
soybeans, the Kunitz-t\ pe try psin inhibitor (soybean trypsin 
inhibitor. STI) and the Bowman-Birk protease inhibitor 
(BBI) (See e.g., Birk, Int. J. Pept. Protein Res., 25:1 13-131 
[1985|; and Kennedy, Am. J. Clin. Neutr. 68T406S-1412S 
1 1998]). These inhibitors serve as a scaffold for the variant 
sequences. In addition to alterations in the scaffold compris- 
ing the variant sequences, other desired proteins used herein 
include the addition of six histidine residues at theC-terminus 
(See. FIGS. 1 and 2). 

Soybean Trypsin Inhibitor (STI) 

STI inhibits the proteolytic activity of trypsin by the for- 
mation of a stable stoichiometric complex (See e.g., Liu, 



Chemistry and Nutritional Value of Soybean Components. In: 
Soybeans, Chemistry Technology and Utilization, pp. 32-35, 
Aspen Publishers, Inc.. Gaithersburg. Mel.. ) 1 099]). STI con- 
sists of 181 amino acid residues with two disulfide bridges 
and is roughly spherically shaped (See e.g.. Song et af. J. 
Mol. Biol.. 275:347-63 |1998|). fhe trypsin inhibitory loop 
lies within the first di ulfidebrid e i nt/-tvpe soybean 
trypsin inhibitor (STI ) has played a key role in the early study 
of proteinases, having been used as the main substrate in the 
biochemical and kinetic w ork that led to the definition of the 
standard mechanism of action of proteinase inhibitors. 
Bowman-Birk Inhibitor (BBI) 

Bowman-Birk inhibitor proteins arc a kinctically and 
structurally well-eharacteri/ed family ol small proteins (60- 
90 residues) isolated from leguminous seeds, as well as other 
plants, including various grasses I'hey typically have a sym- 
metrical structure of two tricyclic domains each containing an 
independent binding loon, although some hav e one domain 
and some have more than two domains. The major -8 kl)a 
Bowman-Birk inhibitor isolated from soybeans (BBI) has 
two separate reactive site loops, loop I inhibits proteases 
having tr\psin-like specificity and loop II inhibits proteases 
with chymotrypsin-like specificity (See e.g.. Chen et al., J. 
Biol. Chem., 267:1990-1994 (1992|; Werner and Wemmer, 
Biochem., 31:999-1010 [1992]; Lin et al., Eur. J. Biochem., 
212:549-555 [1993]: Voss etal., Eur. J. Biochem., 242:122- 
131 [ 1 996]: and Hillings et al.. Pro. Natl. Acad. Sci., 89:3120- 
3 124 ( l'/92| i. I hose binding regions each contain a "canoni- 
cal loop" structure, which is a motif found in a variety of 
serine proteinase inhibitors (Bode and Huber, Eur. J. Bio- 
chem., 204:433-451 [1992]). STI and BBI are found only in 
the soybean seed, and not in any other part of the plant (See 
e.g.. Birk, Int. J. Pept. Protein Res., 25:1 13-131 [1985]). 

Although numerous iso forms of BBI have been character- 
ized, SEQ ID NO: 13 shows the amino acid sequence of the 
BBI backbone used herein comprising approximately 71 
amino acid residues (See Example 1). 

hi soybeans. BBI is produced as a pro-protein with an 
N-terminal pro-peptide that is 1 9 amino acids in length. Thus, 
in some embodiments. BB] is produced with all or at least a 
portion of the propeptide. In some embodiments, BBI is trun- 
cated, with as many as 10 amino acid residues being removed 
from cither the N- or C -terminal. For example, upon seed 
desiccation, some BBI molecules have the < '-terminal 9 or 10 
annuo acid residues removed Thus, proteolysis is generally 
highly tolerated prior to the initial disulfide and just after the 
terminal disulfide bond, the consequences of which are usu- 
ally not detrimental to the binding to target protein, I lowever. 
it will be appreciated that any one oft he iso forms or truncated 
forms find use in various embodiments of the present inven- 

Protease Inhibitor Variants 

As indicated above, the STI and BBI protease inhibitors 
'■ have bind ■ _ li ps lb t inhibit protcasi I 'hi pn si nt inv n 
tion provides protease inhibitor variants with alterations in 
one ormore reactive sites (e.g., Loop I and/or Loop II of BBI). 
In some preferred embodiments, the loi ips .ire replaced with 
sequences that interact with a target protein. 

For example, in some embodiments, the loops are replaced 
with sequences derived fromVKGF binding proteins, inhibi- 
tors of the complement pathway such as C2, C3. C4 or C5 
inhibitors, ("oinpstatin. cytokines, other proteins of interest, 
etc. Indeed, it is not intended that the present invention be 
limited to any particular sequence substituted into either of 
these loops, as any suitable sequence finds use in the present 
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In some embodiments, variant sequences are selected by 
various methods known in the art. including but not limited to 
phage display and other suitable screening methods. For 
example, a random peptide gene library is fused with phage 
Pill gene so the peptide library- will be displayed on the 
surface of the phage. Subsequently, the phage display library 
is exposed to the target protein and washed with buffer to 
remove non-specific binding (this process is sometimes 
referred to as panning), finally, the binding phage and PCR 
the DNA sequence for the peptide encoded are isolated. 

In most embodiments, one of the loops is replaced with a 
variant sequence (i.e.. peptides: often 3 to 14 amino acids in 
length, with 5 to 10 amino acids being preferred). Longer 
sequences find use in the present invention, as long as they 
provide the binding and/or inhibition desired. In addition, 
peptides suitable lor use is replacements of the binding 
loop(s) preferably adopt a functional conformation when 
contained within a constrained loop (i.e.. a loop funned by the 
presence of a disulfide bond between two cysteine residues). 
In some specific embodiments, the peptides are between 7 
and 9 ammo acids in length, these replacement sequences 
also provide protease inhibition or binding to the targeted 
proteins. In some embodiments, it is advantages to alter a 
single amino acid. 
Fusion Proteins 

In preferred embodiments, each protease inhibitor or vari- 
ant thereof is expressed as a fusion protein by the host bacte- 
rial cell. Although cleavage of the fusion polypeptide to 
release the desired protein will often be useful, it is not nec- 
essary. Protease inhibitors and variants thereof expressed and 
secreted as fusion proteins surprisingly retain their function. 

The above-defined DNA sequences encoding the corre- 
sponding amino acid sequences are combined to form a 
"fusion DNA sequence." Such fusion DNA sequences are 
assembled in proper reading frame from the 5' terminus to 3' 
terminus in the order of first, second, third and fourth DNA 
sequences. As so assembled, the DNA sequence encodes a 
"fusion polypeptide" encoding from its amino-terminus a 

rial species, a secreted polypeptide or portion thereof nor- 
mally secreted from a bacterial species, a cleavable linker 
peptide and a desired polypeptide (e.g.. a protease inhibitor 
and variants thereof). Various methods are known to tln.se m 
the art for the production of fusion proteins (See e.g., U.S. Pat. 
Nos. 5.411.873. 5.429.950. and 5.679.543. all of which are 
incorporated by reference herein), [bus. it is intended that 
any suitable method will find use in the present invention. 

Expression of Recombinant Protease Inhibitors 

To the extent that the present invention depends on the 
production of fusion proteins, it relies on routine techniques 
in the field of recombinant genetics. Basic texts disclosing the 
general methods of use in this invention include Sambrook et 
af. Moht-ular Cloning,, A laboratory Manual ((2nd ed.) 
| 1984 1 ); Kriegler. ( lene I'rai • ■ision II abi ra- 

tory Manual ,1990): and Ausubel et al.. (eds.), Current Pro- 
tocols in Molecular Biology ( 1 994). 

The present invention provides bacterial host cells which 
have been transduced, transformed or transfected with an 
expression vector comprising a protease inhibitor-encoding 
nucleic acid sequence. The culture conditions, such as tem- 
perature, pH and the like, are those previously used for the 
parental host cell prior to transduction, transformation or 
translection are apparent to those skilled in the art. 

Basically, a nucleotide sequence encoding a fusion protein 
is operably linked to a promoter sequence functional in the 
host cell. This promoter-gene unit is then typically cloned into 
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intermediate vectors before transformation into the host cells 
for replication and/ or expression. These intermediate vectors 
are typically prokaryotic vectors (e.g., plasmids, or shuttle 
vectors). However, it is not intended that the present iu\ cm ion 

5 be limited to the use of intermediate vectors, as this step is 
omitted in some preferred embodiments. 

In one approach, a bacterial culture is transformed with an 
expression sector having a promoter or biologically active 
promoter fragment or one or more (e.g.. a scries) of enhancers 

a> which functions in the host cell, operably linked to a nucleic 
acid sequence encoding a protease inhibitor, such that the a 
protease is expressed in the cell. In some preferred embodi- 
ments, the DNA sequences encode a protease inhibitor or 
variant thereof. In another preferred embodiment, the pro- 

15 moter is a regulatable one. 

Nucleic Acid t onstnicts Expression Vectors. 

Natural or synthetic polynucleotide fragments encoding a 
protease inhibitor (i.e., "Pi-encoding nucleic acid 

, f . sequences") may be incorporated into heterologous nucleic 
acid constructs or \ ectors. capable of introduction into, and 
replication in. a bacterial cell, fhc vectors and methods dis- 
closed herein are suitable lor use in various host cells for the 
expression id protease inhibitors and variants thereof. Any 

, s vector may be used as long as it is replicable and v iable in the 
cells into which it is introduced. I arge numbers of suitable 
vectors and promoters are known to those of skill in the art, 
and are commercially available. Appropriate cloning and 
expression vectors are also described in various references 

M , known to those in the art (See e.g., Sambrook et al., supra and 
Ausubel et al., supra, expressly incorporated by reference 
herein). The appropriate DNA sequence is inserted into a 
plasmid or vector (collectively relerred to herein as '"vec- 
tors") by any suitable method. In general, the I )NA sequence 

35 is inserted into an appropriate restriction endonuclease site(s) 
by standard procedures known to those in the art. 

Appropriate vectors are typically equipped with a select- 
able marker-encoding nucleic acid sequence, insertion sites, 
and suitable control elements, such as termination sequences. 

4(i In some embodiments, the vectors comprise regulatory 
sequences, including, for example, control elements (i e . 
promoter and terminator elements or 5' and or V untranslated 
regions), effective lor expression of the coding sequence in 
host cells (and or in a vector or host cell environment in which 

: , a modified soluble protein coding sequence is not normally 
expressed), operably linked to the coding sequence. I arge 
numbers of suitable vectors and promoters are know n to those 
of skill in the art. many of which are commercially available 
and known to those in the art. 

50 Exemplary promoters include both constitutive promoters 
and inducible promoters. Such promoters are well know n to 
those of skill in the art. I hose skilled in the art are also aware 
that a natural promoter can be modified by replacement, 
substitution, addition or elimination of one or more nucle- 
us otiues without changing its function. I he practice of the 
present invention encompasses and is not constrained by such 
alterations to the promoter. The choice of promoter used in the 
genetic construct is within the knowledge of one skilled in the 

fhc choi e of the proper selectab uu d i 
th I ell . Appropriate l V for different I lenal hos 
are well known in the art. Typical selectable marker genes 
encode proteins that (a) confer resistance to antibiotics or 
othertoxms (e.g.. ampictllm. methotrexate, tetracycline, neo- 
- mycin my et .phenolic acid, puromycin. /eomycin, or hygro- 
mycin; arlb) complement an auxotrophic mutation or a natu- 
rally occurring nutritional deficiency in the host strain. 
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In some embodiments, a selected PI coding sequence is 
inserted into a suitable vector according to well-known 
recombinant techniques and used to transform a cell line 
capable of PI expression. Due to ihe inherent degeneracy of 
the genetic code. * itlier nucleic acid sequences which encode 5 
substantially Ihe same or a luiictionalh equiv alent amino aeid 
sequence may be used to clone and. express a specific protease 
inhibitor, as further detailed above. I herefore it is appreciated 
that such substitutions in the coding region fall within the 
sequence variants covered by the present invention. Any and 10 
all of these sequence valiants can be utilized in the same way 
as described herein for a parent PI -encoding nucleic acid 
sequence. Those skilled in the art recogni/e that ciillering Pis 
will be encoded b> differing nucleic acid sequences. 

In some embodiments, once the desired form of a protease 1 5 
inhibitor nucleic aeid sequence, homologue, variant or frag- 
ment thereof, is obtained, it is modified by any number of 
ways. Where the sequence involves non-coding flanking 
regions, the Hanking regions may be subjected to resection, 
mutagenesis, etc. 1'luis. transitions, transversions, deletions, 20 
and insertions may be performed on the naturally occurring 
sequence. 

In some preferred embodiments, heterologous nucleic aeid 
constructs include the coding sequence for at least one pro- 
tease inhibitor, or variant! si. fragment! s) or splice variant(s) 25 
thereof: (i) in isolation'. Cii) in combination with additional 
coding sequences; such as fusion protein or signal peptide 
coding sequences, w here t he PI coding sequence is the domi- 
nant coding sequence; (iii) in combination with non-coding 
sequences, such as control elements, such as promoter and 30 
terminator elements or 5' and.or V untranslated regions 
effective for expression of the coding sequence in a suitable 
host; and/or (iv) in a vector or host environment in which the 
PI coding sequence is a heterologous gene. 

In some embodiments, heterologous nucleic acid contain- 35 
ing the appropriate nucleic acid coding sequence, together 
with appropriate promoter and control sequences, is 
employed to introduced into bacterial host cells to permit the 
cells to express at least one protease inhibitor or variant 
thereof. 40 

In some embodiments of the present invention, a heterolo- 
gous nucleic acid construct is employed to transfer a PI- 

I erred embod Pit 
nucleic acid sequences of the present invention. Thus, any 45 
suitable method for ei'feeti\el\ generating stable transfor- 
mants linds use in the present invention. 

In additional embodiments of the present invention, the 
first and second expression cassettes are present on a single 
vector. w bile in other embodiments these cassettes are present 50 
on separate vectors. 

In some preferred embodiments, in addition to a promoter 
sequence, the expression cassette also contains a transcription 
termination region downstream of the structural gene to pro- 
vide lor efficient termination. In some embodiments, the ter- "s 
initiation region is obtained from the same gene as the pro- 
moter sequence, while in other embodiments it is obtained 
from another gene. The selection of suitable transcription 
termination signals is well-known to those of skill in the art. 

In addition, it is contemplated that any suitable expression <■ < 
vector will find use in the present invention. Indeed, it is 
contemplated that various conventional vectors used for 
expression in eukaryotic or prokaryotic cells will be suitable 
and find use with the present invention. Standard bacterial 
expression vectors include bacteriophages /.and Vfl Caswell 
as plasmids such as pBR322 based plasmids, pSKF, pET23D. 
and fusion expression s\ stems such as MB P. GST. and LacZ. 



In further embodiments, epitope tags are added to recombi- 
nant proteins, in order to provide convenient methods of 
isolation (e.g.. c-myc). 

Additional elements typically included in expression vec- 
tors are replieons. a gene encoding antibiotic resistance to 
permit selection of bacteria that harbor recombinant plas- 
mids. and unique restriction sites in nonessential regions of 
theplasmidto allow insertion ol heterologous sequences. The 
particular antibiotic resistance gene chosen is not critical, any 
of the many resistance genes know n in the art are suitable. 

Introduction of a Protease Inhibitor-Encoding Nucleic Acid 
Sequence into Host Cells. 

In some preferred embodiments, the methods of the present 
invention provide host cells that contain a stably integrated 
sequence of interest (i.e.. Pi-encoding nucleic acid). How- 
ever, in alternative embodiments, the methods of the present 
invention provide for maintenance of a self-replicating extra- 
chromosomal transformation vector. 

Ihe invention further provides cells and cell compositions 
which have been genetically modified to comprise an exog- 
enously provided Pl-encoding nucleic acid sequence. In some 
embodiments, a parental host cell is genetically modified by 
an expression vector. In some embodiments, the vector is a 
plasmid, while in other embodiments the vector is a viral 
particle, phage, naked DNA. etc. Thus, it is not intended that 
the form of the vector be limited to any particular type of 
vector, as various vectors will find use in the present inven- 

Various methods may be employed for delivering an 
expression vector into cells in vitro. Methods of introducing 
nucleic acids into cells for expression of heterologous nucleic 
acid sequences are also known to the ordinarily skilled arti- 
san, including, but not limited to electroporation; protoplast 
fusion with intact cells: transduction; high velocity bombard- 
ment with DNA-coated microprojectiles; infection with 
modified viral (e.g.. phage) nucleic acids; chemically-medi- 
ated transformation, competence, etc. In addition, in some 
embodiments, heterologous nucleic aeid constructs compris- 
ing a Pi-encoding nucleic acid sequence arc transcribed in 
vitro, and the resulting RNA introduced into the host cell by 
any of the suitable methods known in the art. 

hollowing introduction ofa heterologous nucleic acid con- 
struct comprising the coding sequence for a protease inhibi- 
tor, the genetically modified cells are cultured in conventional 
nutrient media modified as appropriate for activating promot- 
ers, selecting transformants, and/or amplifying expression of 
a PI -encoding nucleic acid sequence. The culture conditions, 
such as temperature, pll and the like, are those previously 
used for the host cell selected for expression, and are apparent 
to those skilled in the art. 

The progeny of cells into which sue!; heterologous nucleic 
acid constructs have been introduced are generally consid- 
ered to comprise the IT-encoding nucleic acid sequence 
found m the heterologous nucleic acid construct. 

Bacterial Hosts and Expression 

Appropriate host cells include any suitable bacterial spe- 
cies. In some embodiments, the bacterial hosts serve both as 
the expression hosts and the source of the first and second 
nucleic acids. Using the present inventive methods and host 
cells, surprising levels of expression have been obtained. The 
system utilized herein has achieved levels of expression and 
secretion of greater than 0.5 gd of protease inhibitor. 

\fterthe expression vector is introduced into the host cells, 
the transfected host cells are cultured under conditions favor- 
ing expression of gene encoding the desired protein. Targe 
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batches of transformed cells can be cultured as described 
above. Finally, product is recovered from the culture using 
techniques known in the art. 

Accessory proteins sue!) as thiol-disulfide oxidoreductases 
or chaperones find use m some embodiments, as they may be K 
beneficial to help fold the secretory protein into its active 
conformation, f hiol-disulslide oxidoreductases and protein 
disulfide isomerases cataly/e the formation of the correct 
disnllide bonds in the protein ( )v erexpression of the hdbl )( ' 
operon m II si, hulls has been shovui to be beneficial for the 10 
production of a protein with disulfide bonds (See e ».. Meima 
et ah, J. Biol. Chem., 277:6994-7001. 1 20021,). Chaperones 
help the secretory protein to fold by binding to exposed 
hydrophobic regions in the unfolded states and preventing 
unfavourable interactions and prolyl-peptidyl cis-trans is 
isomerases assist in formation of the proper conformation of 
the peptide chain adjacent to proline residues. 

In some embodiments of the present invention, the host 
cells are transformed with an expression \eetor encoding at 
least one thiol-disulfide oxidoreductaseorchaperone. It is not - :| 
intended thai the present in\ ention be limited to any particular 
thiol-disulfide oxidoreductase or ehaperone. as any suitable 
thiol-disulfide oxidoreductase or ehaperone known to those 
skilled in the art will find use in the present invention. 

In some embodiments of the present invention, the fraction 25 
of properly folded secretory protein is increased b\ the addi- 
tion of chemicals to the growth medium that reduce/oxidize 
disulfide bonds, and/or alter the general redox potential, and/ 
or chemicals that alter solvent properties thus affecting pro- 
tein conformation and aggregation. In particularly preferred 3 " 
embodiments, a reagent that reduces disulfide bonds, such as 
2-mercaptoethanol, is preferred to increase the fraction of 
correctly folded protein. Howev er, in other embodiments and 
depending on the medium used, other disulfide reducing or 
oxidizing agents (e.g., DTT. TCEP, reduced and oxidized 35 
glutathione, cysteine, cystine, cvsteamine. thiouK colute. 
S 2 0 3 2 -, S,0 4 2 ", S 2 0, 2 -, SO/", S 2 0 7 2 -. Cu+, ect), either 
used alone or in combination, find use in the present inven- 
tion. It is contemplated that other adjuvants that alter solvent 
properties, (e.g.. urea. DMSO. f WfHN k -SO. etc.). either 4u 
added to the growth medium alone or preferably in combina- 
tion with disulfide reducing/oxidizing agents, such as j'Afb. 
will also increase the fraction of correctly folded secretory 
protein and find use in various embodiments of the present 
l. In some preferred embodiments, the fMf is used 
ing from 0.5 to 4 mM, while in other 
embodiments, the concentrations range from 0.1 mM to 10 
mM. Indeed, those of skill m the art know how to select the 
best growth medium and growth conditions to optimize the 
effects of the added thiol reducing/oxidizing agents and/or -" 
other adjuvants, as well as the concentration of thio reducing 
oxidizing agents and'or other adjuvants to use. It is not 
intended that the present in\ ention be limited to any particular 
disulfide reducing, oxidizing agent or adjuvant, as any suit- 
able reagents known to those skilled in the art find use in the 3:1 



for ciilluring bacteria 
weil I now n in the art 



In addition to the carbon and energy source, oxygen, 
assimilable nitrogen, and an inoculum of the microorganism, 
it is necessary to supply suitable amounts in proper propor- 

growth. maximi c i i I I fill in md ener > 

source by the cells in the microbial conversion process, and 
achieve maximum cellular yields w itb maximum cell density 
in the fermentation medium. 

Various culture media find use in the present invention, as 
known to those of skill in the art. 1 lowever. standard bacterial 
culture media find use in the present invention. In some pre- 
ferred media formulations, the media include, in addition to 
nitrogen, suitable amounts of phosphorus, magnesium, cal- 
cium, potassium, sulfur, and sodium, in suitable soluble 
assimilable ionic and combined forms, and also present pref- 
erably should be certain trace elements such as copper, man- 
ganese, molybdenum, zinc. iron, boron, and iodine, and oth- 
ers, again in suitable soluble assimilable form, all as known in 
the art. 

In some embodiments, the fermentation reaction involves 
an aerobic process in w Inch the molecular oxygen needed is 
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assisting the microorganism species to grow in a thriving 
fashion. In effect, by using an oxygenated hydrocarbon sub- 
strate, the oxygen requirement for growth of the microorgan- 
ism is reduced. Nevertheless, molecular oxygen must be sup- 
plied for growth ol aerobic and to a lesser extent, facultative 

Although the aeration rate can vary over a considerable 
range, aeration generally is conducted at a rate which is in the 
range of about 0.5 to 1 0. preferably about 0.5 to 7, volumes (at 
the pressure employed and at 25° C.) of oxygen-containing 
gas per liquid volume in the fermentor per minute. This 
amount is based on air of normal oxygen content being sup- 
plied to the reactor, and in terms of pure oxygen the respective 
ranges would be about 0.1 to 1.7. or preferably about 0.1 to 
1.3, volumes (at the pressure employed mid at 25° C.) of 
oxygen per liquid volume in the fermentor per minute. 

The pressure employed for the microbial conversion pro- 
cess can range widely. Pressures generally are within the 
range of about 0 to 50 psig. presently preferably about 0 to 30 
psig. more pre fen. ly at I - slightly ov er atmospheric pres- 
sure, as a balance of equipment and operating cost versus 
oxygen solubility achieved, (neater than atmospheric pres- 
sures are advantageous in that such pressures do tend to 
increase a dissolved oxygen concentration in the aqueous 
ferment, which in turn can help increase cellular growth rates. 
At the same time, this is balanced by the fact that high atmo- 
spheric pressures do increase equipment and operating costs. 

The fermentation temperature can vary somewhat, but for 
most bacterial species used in the present invention, the tem- 
perature generally will be within the range of about 20 C. to 
40° C, generally preferably m the range of about 28° C. to 37° 
( '.. depending on the strain of microorganism chosen, as 
known to those skilled in the art. 

'fhe microorganisms also require a source of assimilable 
nitrogen. The source of assimilable nitrogen can be any nitro- 
gen-containing compound or compounds capable of releas- 
ing nitrogen in a form suitable for metabolic utilization by the 
microorganism. While a variety of organic nitrogen source- 
compounds, such as protein hydrolysates. can be employed, 
usually cheap nitrogen-containing compounds such as 
ammonia, amnion mi h Iroxidc ir nd various ammo- 
nium salts such as ammonium phosphate, ammonium sulfate. 
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ammonium pyrophosphate, ammonium chlondc. or various 
other ammonium compounds can be utilized. Ammonia gas 
itself is convenient lor large scale operations, and can be 
employed by bubbling thr< >ugh the aqueous ferment (fermen- 
tation medium) in suitable amounts. At the same time, such 5 
ammonia can also be employed to assist in pH control. 

The pi I range in the aqueous microbial ferment (fermen- 
tation admixture) should be in the exemplary range of about 
2.0 to 8.0. However. pH range optima for certain microorgan- 
isms are dependent on the media employed ti . some extent, as 
well as the particular microorganism, and thus change some- 
what with change in media as known to those skilled in the art. 

While the average retention time of the fermentation 
admixture in the fermentor can vary considerably, depending 1 5 
in part on the fermentation temperature and culture 
employed, as known in the art. 

In some embodiments, the fermentation is preferably con- 
ducted in such a milliner that the carhon-containing substrate 
can be controlled as a limiting factor, thereby providing good 
conversion of the carbon-containing substrate to cells and 
avoiding contamination of the cells with a substantial amount 
of unconverted substrate. The latter is not a problem with 
water-soluble substrates, since any remaining traces are , 5 
readi ly removed. It may be a problem, however, in the case of 
non-water-soluble substrates, and require added product- 
treatment steps such as suitable washing steps. The time 
needed to reach this limiting substrate level is not critical and 
may van, with the particular microorganism and fermentation ; 
process being conducted. However, it is well known in the art 
how to determine the carbon source concentration in the 
fermentation medium and whether or not the desired level of 
carbon source has been achieved. 

Although m some embodiments, the fermentation is con- 
ducted as a batch or continuous operation, fed batch operation 
is generally preferred for ease of control, production of uni- 
form quantities of products, and most economical uses oi all 
equipment. 40 

If desired, part or all of the carbon and energy source 
material and or part ol'the assimilable nitrogen source such as 
ammonia can be added to the aqueous mineral medium prior 
to feeding the aqueous mineral medium into the fermentor. 
Indeed, each o I' the streams introduced into the reactor pref- 45 
erabK is controlled at a predetermined rate, or in response to 
a need determinable by monitoring such as concentration of 
the carbon and energy substrate, pi I. dissolved oxygen, oxy- 
gen orearbon dioxide in tbeoff-gases from the fermentor. cell 
density measurable by light transnnttancy, or the like. The 50 
teed rates ol'the various materials can be varied so as to obtain 
as rapid a cell gn >vv th rale as possible, consistent w nil efficient 
utilization ol'the carbon and energy source, to obtain as high 
a yield of microorganism cells relative to substrate charge as 
possible, but more importantly to obtain the highest produc- S s 
tion of the desired protein per unit volume. 

In either a batch, or the preferred fed batch i iperation, all 
equipment, reactor, or fermentation means, vessel or con- 
tainer, piping, attendant circulating or cooling devices, and 
the like, are initially sterilized, usually b\ employing steam i.u 
such as at about 121° C. for at least about 15 minutes. The 
sterilized reactor then is inoculated with a culture of the 
selected microorganism in the presence of all the required 
nutrients, including oxygen, and the carbon-containing sub- 
strate. 1'be type of fermentor employed is not critical, though c<s 
in some embodiments, the 151 Biolafitte (Saint -< rermain-eu- 
I aye. France) is preferred. 
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Protein Separations 

In preferred embodiments, once the desired protein is 
expressed, the secreted protein is recovered. The present 
invention provides methods of separating a desired protein 
from its fusion analog. It is specifically contemplated that the 
methods described herein will find use in the separation of 
proteinase inhibitor and variants from the fusion analog. 

The collection and purification of the desired protein from 
the fermentation broth can also be achieved using procedures 
know n to those of skill in the art The fermentation broth will 
generally contain cellular debris, including cells, various sus- 
pended solids and other biomass contaminants, as well as the 
desired protein product, which are preferably removed from 
the fermentation broth by means known in the art. Suitable 
processes for such removal include conventional solid-liquid 
separation techniques (e.g.. centrifugation. filtration, dialy- 
sis, microliltration. rotary vacuum filtration, or other known 
processes), to produce a cell-free filtrate. In some embodi- 
ments, it is preferable to further concentrate the fermentation 
broth or the cell-free filtrate prior to the purification and/or 
crystallization process using techniques such as ultrafiltra- 
tion, evaporation and/or precipitation. 

Precipitating the proteinaeeous components of the super- 
natant or filtrate may be accomplished by means of a salt (e.g., 
ammonium sulfate) or low : pH (typically less than 3), fol- 
lowed by purification by a variety of chromatographic proce- 
dures (e.g.. ion exchange chromatography, affinity chroma- 
tography, hydrophobic interaction chromatography, 
hydrophobic charge induction chromatography etc.) or simi- 
lar art recognized procedures. It is not intended that the 
present invention be limited to any particular separation 
method, as it is contemplated that any method will find use in 
the present invention. 

In certain preferred embodiments, when the expressed 
desired polypeptide is secreted from the bacterial cells, the 
polypeptide is purified from the growth media. In preferred 
embodiments, the expression host cells are removed from the 
media before purification ol'the poh peptide (e.g. by centrifu- 

When the expressed recombinant desired polypeptide is 
not secreted from the host cell, the host cell is preferably 
disrupted and the polypeptide released into an aqueous 
"extract" which is the first stage of purification. Preferably, 
the expression host cells are collected from the media before 
the cell disruption e g. by centrifugation i 1 'he cell disruption 
may be performed by using any suitable means known in the 
art. such as by lyso/vme or beta-glucanase digestion or by 
forcing the cells through high pressure (See e.g., Scobes, 
I'mtein I'urijn alion. Second edition. Springer- Verlag). 

In some embodiments, the addition of'six histidine residues 
(i.e.. a "His Tag") to the i -terminus is used as an aid in the 
purification of'the desired protein and its fusion analog. Use 
of the His tags as a purification aid is well known in the art 
(See e.g.. Hengen, TIBS 20:285-286 [19951). The 6x his- 
tagged proteins are easily purified using Immobilized Metal 
ion Aflity Chron I ph (IMA< is known to those 
skilled in the art. 

Purity 

For some applications, it is of great importance that the 
protease inhibitors produced using the present invention be 
very highly pure (e.g.. having a purity of more than 99%). 
This is particularb, true whenever the desired protein is to be 
used as a therapeutic, but is also necessary tor oilier applica- 
tions. The methods described herein provide a way of pro- 
ducing substantially pure desired proteins. The desired pro- 
teins described herein are useful in pharmaceutical and 
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personal care compositions. ! lowev or. H is contemplated th.it 
proteins of varying purity levels will be produced using the 
methods of the present invention and it is not intended that the 
proteins produced using the present invention be limited to 
any particular level of purity. 5 

Activation ol Bl i\ I )uring Purification 

In some embodiments of the present invention, after 
growth during the purification process, the activity of the 
protein is increased by tiie addition of chemicals that reduce/ I0 
oxidi/e disulfide bonds and/or alter the general redox poten- 
tial, and/or chemicals that alter solvent properties thus affect- 
ing protein conformation and aggregation. In some 
particularly preferred embodiments, addition of a reagent that 
reduces disulfide bonds, such as 2-mercuptoethanol. is used , 
to increase activity of the protein. 1 lovvever. as those skilled in 
the art appreciate, depending purity and buffer composition, 
other disulfide reducing or oxidizing agenls ie.g.. DTT, 
FCEP. reduced and oxidized glutathione, cysteine, evstme. 
cysteamine. thioglyeolate. S,0, 3 -, S,0 4 2 ", SjO, 3 ", S0 3 2 ", , () 
S,0 7 2 ~, Cu+, protein disulfide isomerases. protein thiol-dis- 
ulfide oxidoreduclnses. etc. I. either used alone or in combi- 
nation, find use in the present invention. < )ther adjuvants that 
alter solvent properties, (e.g. ethanolamine. DMSO, 
TWFFNR-80, arginine. urea, etc.). either added alone or 2$ 
[ireterably in combination with disulfide reducing-oxidizing 
agents, suchas (5ME, during the purification process also find 
use in the present invention by increasing the activity of the 
protein. In certain preferred embodiments, partially purified 
prolem is diluted in buffer (in some particularly preferred 30 
embodiments, a zwitterionic buffer with TWEEN*-80 at 
basic pH) and activated with bME and a disulfide oxidizing 
agent (in alternative preferred embodiments, oxidized glu- 
tathione or sodium sulfite). 

In addition, it is contemplated that conditions will be ^ 
screened in order to determine the optimal activation of the 
protein, if desired. For example, various |!Mf concentrations 
(0. 1-10 mM ). oxidizing agent concentrations ( 0 to 1 20 to 20 
times the |]ME concentration) pH (7.5-9.5), temperamres 
(15-40° C), dilutions (1-20 fold), incubation times (12-72 h). 40 
aeration (incubations under inert gas to vigorous mixing 
under oxygen containing gases), buffer types ( I ris. CHES, 
CAPS,Tricine. TAPS, other zwitterionic buffers, etc.), buffer 
concentrations (0.1-1 M), and the addition of various adju- 
vants known to alter solvent properties ihereby affecting pro- 4 ^ 
tein confirmation and aggregation (e.g.. ethanolamine, 
DMSO, TWF FN R -80. argi mne. urea. etc. fare tested in order 
to determine the optimum conditions for the expression sys- 
tem used. It is not intended that the present invention be 
limited to any particular disulfide reducing oxidizing agent. s , 
dilution, temperature, pi I, buffer type or composition, or 
adjuvant, as any suitable reagents known to those skilled m 
the art find use in the present 
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The present invention is described in further detail in the 
following Examples winch are not in any way intended to 
limit the scope of the invention as claimed. The attached 
Figures are meant to be considered as integral parts of the t 
specification and description of the invention. All references 
cited are herein specifically incorporated by reference for all 
that is described therein, f lie follow ing examples arc offered 
to illustrate, but not to limit the claimed invention In the 
experimental disclosure which follows, the following abbre- <• 
v unions apply PI (proteinase inhibitor). BBI (Bowman-Birk 
inhibitor), STI (So\ bean Trypsin inhibitor): VEGF and VegF 



(vascular endothelial grow th factor); ppm (parts per million); 
M (molar): mM (millimolar): uM (micromolar); nM (nano- 
molar); mol (moles); mmol (millimoles): umol (micro- 
moles); nmol (nanomoles); gm (grams): mg (milligrams): ug 
(micrograms); pg tpicograms): 1 ( liters); ml and ml. (milli- 
liters): id and ill (microliters): em (centimeters): mm (milli- 
meters); nm (micrometers); nm (nanometers); U (units); V 
(volts); MW (molecular weight): sec (seconds); min(s) 
(minute/minutes): h(s) and hps) (hour hours): " ('. (degrees 
Centigrade); QS (quantity sufficient); ND (not done); NA 
(not applicable ): rpm (revolutions per minute); Iff) (water): 
dl !,( ) (deionized water); (I K d (hydrochloric acid), aa (amino 
acid); bp (base pair); kb (kilobase pair): kD (kilodaltons); 
cDNA (copy or complimentary DNA): DNA (deoxyribo- 
nucleic acid); ssDNA (single stranded DNA): dsDNA 
(double stranded DNA): dN I P (deoxyribonucleolide triph- 
osphate); RNA (ribonucleic acid): Mg( "|. (magnesium chlo- 
ride); NaCl (sodium chloride); w/v (weight to volume); v/v 
(volume to volume): g igrav ity ). Of) (optical density); Dul- 
beceo's phosphate buffered solution (DPBS); SOC (2% 
Bacto-Tryptone. 0.5% Bacto Yeast Extract, 10 mM NaCl, 2.5 
mM KCI): Terrific Broth (TB; 12 g/1 Bacto Trvptone, 24 g/1 
glycerol, 2.31 g/1 KH 2 P0 4 . and 12.54 g/1 K,HP0 4 ); OD 280 
(optical density at 280 nm): OD 600 (optical density at 600 
nm); A 405 (absorbance at 405 nm): Vmax (the maximum 
initial velocity of an enzyme catalyzed reaction); PAGE 
(polyacrylamide gel electrophoresis I; PBS (phosphate buff- 
ered saline 1 1 50 mM NaCl. 1 0 mM sodium phosphate buffer, 
pH 7.2]); PBST (PBS+O.25% TWEEN® 20); PEG (polyeth- 
ylene glycol); PCR (polymerase chain reaction); RT-PCR 
(reverse transcription PCR); SDS (sodium dodecyl sulfate); 
Tris (tris(hydroxymethyl)aminomethane); HEPES (N-[2- 
HydroxyethylJpiperazine-N-|2-ethanesulfonic acid)); HBS 
(HEPES buffered saline); SDS (sodium dodecylsulfate); 
bME, BME and fsME (beta-mercaptoethanol or 2-mercapto- 
ethanol); Tris-HCl (tris[Hydroxymethyl]aminometliane-hy- 
drochloride); Tricine (N-[tris-(hydroxymethyl)-methyl]-gly- 
cine); C! 11 -S (2-(N-cyclo-hexylamino 1 ethane-sul Ionic acid); 
1APS (.i-{[tris-(hydroxyniethyl)-methvl|-amino}-propanc- 
sulfonic acid); CAPS (3-(cyclo-hexyIamino)-propane-sul- 
lomc acid; DMSO (dimethyl sulfoxide); DTT (1.4-dilhio- 
Dl.-threitol); Cilut and GSH (reduced glutathione); GSSG 
(oxidized glutathione); TCI P (Tris|2-earboxyethyl] phos- 
phine): Ci (Curies) md imilhCuries); uCi (microCuries); 
TLC (thin layer achromatography): Is (losyl); Bn (benzyl); 
Ph (phenyl): Ms (mesyl ): Ft ( ethyl). Me (methyl); Taq (Ther- 
mits aquaticus \ )NA polymerase); Klenow (DNA polymerase 
1 large (Klenow-) fragment I: rpm (revolutions per minute): 
EGTA (ethylene glycol-bis(|i-aminoethyl ether) N,N.N\N'- 
tetraacetic acid): EDTA (ethylenediaminetetracetic acid); bla 
^-lactamase or ampicillin-resistaiice gene), (iF Healthcare 
((iE. Healthcare. Chalfont St. Cities. United Kingdom); 
DNA2.0 (DNA2.0. Menlo Park, Calif); OXOID (Oxoid, 
Basingstoke, Hampshire, UK); Megazyme (Megazyme Inter- 
national Ireland Ltd.. Bray Business Park. Bray. Co., Wick- 
low. Ireland): Coming (Corning I lie Sciences. Corning. 
N.Y.); (NEN (NI'N 1 lie Science Products. Boston, Mass.): 
Pharma AS (Pharma AS, Oslo, Norway); Dynal (Dynal, Oslo, 
Norway); Bio-Synthesis (Bio-Synthesis. Lewisville, Tex.); 
ATCC (American Tvpe Culture Collection. Rockville, Md.); 
Gibco/BRL (Gibco/BRL, Grand Island, N.Y.): Sigma (Sigma 
Chemical Co.. St. Louts. Mo.); Pharmacia (Pharmacia Bio- 
tech. Pisacataway. N.J.): NCBI (National Center for Biotech- 
nology Information): Applied Biosystems (Applied Biosvs- 
tems, Foster City. Calif); Clontech (CLONTECH 
Laboratories, Palo Alto, Calif.); Operon Technologies (Op- 
eron Technologies, Inc., Alameda, Calif.); MWG Biotech 
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(MW() Biotech, High Point, NX'.); Oligos Ftc (Oligos Etc. 
Inc, Wilsonville, ( >reg. r, Bnchem i Bachcm Bioscience, Inc., 
King of Prussia. Pa,): Dileo ( Ditto Laboratories, Detroit, 
Mich.); Mediated) (Mediulech. I lerndon, Va.; Santa Cruz 
(Santa Orti/ Bi It i Inc.. Santa Cruz. ( "alii, i: Bio V~ 

eris (BioVeris Corp.. Gaithersbnrg, Md. ); Oxoid (Oxoid Inc., 
Ogdensburg, N.Y.); Worthington I Worthington Biochemical 
Corp., Freehold, N.J.); GIBCO BRL or Gibco BRL (Life 
Technologies, Inc.. ( .iaithersburg. Md.); Millipore(Millipore, 
Billenca, Mass.); Bio-Rnd (Bio-Rad, 1 iercules, Calif.): Invit- ; 
rogen (Invitrogen Corp., San Diego. Calif); NFB (New- 
England Biolabs, Beverly, Mass.); Sigma (Sigma Chemical 
Co., St. Louis, Mo.); Pierce (Pierce Biotechnology, Rock- 
ford. 111. ): i'akura (Takara Bio Inc. Otsti, Japan); Roche (Hoff- 
mann-La Roche. Basel. Sw it/erland): KM Science (KM Sci- i 
ence, Gibbstown, N.J.): Qiagen (Qiagen, Inc., Valencia, 
Calif.); Biodesign (Biodesign Intl., Saco, Me.); Aptagen 
(Aptagen, Inc., Ilerndon, Va.): Molecular Devices (Molecu- 
lar Devices. Corp.. Sunnyvale. Calif): R&I) Systems (R&D 
Systems. Minneapolis. Minn.); Slralagene (Stratagene Clon- 2 
ing Systems, La Jolla, Calif); Marsh (Marsh Biosciences, 
Rochester, N.Y.): Bio-Tek (Bio-Tek Instruments. Winooski, 
VT); (Biacore (Biacore. Inc., Piscataway, N.J.); PeproTech 
(PeproTech. Rocky Hill, N.J.); SynPep (SynPep, Dublin, 
( 'alif. ): and Microsoft (Microsoft, Inc.. Redmond, Wash.). 2 

Example 1 



The protein sequence of the major n 
coded by the above synthetic gene is: 



CPCCDQCACTKSNPPQCRCSDMRLNSCHSACKSCICALSVPAvCi. 



The PCR primers used to amplify the BBI gene for fusion 
o the BCE103 cellulase expression cassette in the pJKB 



BBIHxndlll.RV: 



' CTGCAGAAGCTTAAAAATAAAAAAACGGA 



TTTCCTTCAGGAAATCCGTCCTCTGTTAACTT 



TTAGTTCTCTTTATCGTCCTCGC 3 



In this Example, experiments conducted to produce 
BCE103-BBI fusion proteins in B. subtilis are described. The 
DNA sequence of the synthetic gene (Operon Technologies) 
coding lor the pro-BBI protein w ith a < '-terminal hexa-histi- 3 
dine lag used in these experiments is: 



CAGCAATGACGATQAGAGCTCTAAACCCTGTTGCGATCAATGCGCATGTA 



TGTCATAGTGCATGCAAAAGCTGTATCTGCGCCCTGAGTTATCCAGCTCA 



atattittgcgtcgaaatcacgaact-i 



3CTATGAGCCATGTAAACCAA 



GCGAGGACGATAAAGAGAACCATCATCACCATCACCAT 



The protein sequence of pro-BBI with a ( '-terminal hexa- 
histidine lagged coded for by the abo\ e synthetic gene is: 



\U L , — I L I t ILLr " H. 1 TI lir- L It _ 

ICALSYPAQCFCTDITDFCYEPCKPSEDDKENHHHHHH 



fhe portion ol'llie I )NA sequence of the synthetic gene that 
codes for the major mature form of BBI is: 



" ^GTGTCGGTGTTCCGATATGCGTCTGAATAGCTGTCATA 



BBIHIS-Hindlll RV : 5" CTGCAGAAGCTTTAAAAA TAAAAAAACGGA 

) 

TTTCCTTCAGGAAATCCGTCCTCTGTTAACTT 
TTAATGGTGATGGTGATGATGGTTCTC 3 ' 

The sequence of the aprE-BCE103-BBI-IIisTag expres- 
' sion cassette (EcoRI-IIindlll) that was cloned into the 
p.JM103 integration sector is provided in FIG. 1. A schematic 
plasmid map of the pJMl 03BBIHis expression vector is pro- 
vided in FIG. 2. 
The alkaline cellulase (BCE103) gene (See, van Soligen, 
40 U.S. Pat. No. 6.063.61 1. hereby incorporated by reference) 
fused to the B. subtilis aprE promoter and signal sequence, 
was cloned Irom P ( CAPRI 03 (Shaw et al„ J. Mol. Biol.. 
320:303-309 [2002]) as an EcoRI-BamHI fragment (i.e., a 
fragment that carries the coding sequence of the BCF.103 
i> catalytic domain and first cellulose binding domain linker 
only) into p.l VI 103 ( Perego. "Inlegrational vectors for genetic 
manipulation in Bacillus subtilis" In, Bacillus subtilis and 
Other Gram-positive Bacteria: Biochemistry. Physiology, 
and Molecular Cenetics, Sonenshein. I loch, and Losick 
(eds). American Society for Microbiolog) , Washington D.C , 
pp. 615-624 [ 1 W3|). \ gene encoding the soybean Bowman- 
Birk protease inhibitor (BBI) (Swiss-Prot Accession # 
P01055; See, Odani and Ikenaka. J. Biochem., 71: 839-848 
[1972]) with a C-terminal hexa-histidine tag (His-Tag) was 
svnthesi/ed bv Operon Technologies (See. DNA sequence 
ibove).'IheBBIgenewasampliliedl PCI ith primers (all 
primers were synthesized by MWd Biotech. ( Migos Ftc, or 
Operon Technologies) that generated a 5' BamHI site in the 
correct reading frame w ith tile BOF103 gene, and at the 3' end 
n u ill i i i 1 1 I t i thi 

' H i Uli it' vc mylas no) alter tin id hi 

BBI gene with a 3' Hindlll site for cloning into the pJM103 

1 i« i- ts with or without a C-t nun, il His lag were 
i in i im rs BBI s i W i Q ID NO: 14) 
65 and BBIHISHindllI_RV (SEQ ID NO: 16), or BBIfu- 
sion_FW (SEQ ID NO: 14) and BBI-HindIII_RV(SEQ ID 
N( ): 1 5 ), respectively, using the synthetic BBI gene as a tern- 
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plate. I luiess indicated oiherw lse. PGR reactions were typi- 
cally performed on a thermocycler for 30 cycles with High 
Fidelity Platinum Tuq polymerase ( Invitrogen) according to 
the instructions of the supplier (with an annealing tempera- 
ture of 55° C). The PCR fragments were cloned as BamHI- , 
llindlll fragments into pJM103 earning the aprE-BCE 103 
expression cassette. The correct gene sequence was verified 
by DNA sequencing. 

Thus, as shown in FIG. 1. the N-terminus of the mature 
coding region of the BBI gene (with urn ithont the i lis- lag) ( 
was fused in frame to the C -terminal coding region of the first 
CBD (cellulose binding domain) linker sequence coded by 
the BCE 103 cellulase gene. Thereby, the two CBD's of 
B( IT 03 (Shaw el al.. supra ) are replaced by BBI in the final 
expression vectors. pJM103BBI or p,IM103BBIhis (See, 
FIG. 2). The aprli promoter controls the expression of the 15 
BCF103-BBI gene fusions iSee. Ferrari et al.. J. Bact., 170: 
289-295 |1988|; and Henner et al.. J. Bact.. 170: 296-300 
[1988]). 

Compelenl liticllus suhalis cells. BG3934comK (de- 
gU" v 32, oppA, AspoIIE3501. AaprE, AnprE, Aepr, AispA, 20 
Abpr. amyE: :xylRPxy lAcomK-phleo). were transformed 
with the expression plasmids, p.IM103BBl or 
pJM l()3BBIhis. The bacteria were made competent by the 
induction of the comK gene under control of a xylose induc- 
ible promoter (Halm et al.. Mol. Microbiol., 21:763-775 2 5 
1 1996|). The trans Ibrmants were selected on Luria Brothagar 
(1 A) plates containing 5 ug ml chloramphenicol, lb increase 
the expression by gene amplification, colonies were streaked 
and grown several times on LA plates with 25 ug/ml chloram- 
phenicol until die grow 111 rale u uh lire anlibiolie was similar , 
to growth rate in the absence of chloramphenicol. I he 
BCE103-BBI fusion protein was produced by growth in 
shake flasks at 37° C. in TSB medium (Tryptone Soya Broth 
from OXOID. 30 g/L) or in MBD medium, a MOPS based 
defined medium. MBD medium was made essentially as 
described (Neidhardt et al., J. Bacterid., 119: 736-747 1S 
[1974]). except NH 4 CF. FeS() 4 . and CaCE were left out of 
the base medium. 3 niM K HPO, was used, and the base 
medium was supplemented with 60 mM urea. 75 g/I , glucose, 
and 1% sovlone. Also, the micronulrienls were made up as a 
lOOx stock containing in one liter. 4(X) me i eS() 4 -I CO. 100 i' 
me MnS( ) ,.1 1 .(), 100 mg /nS(.) 4 "1 1 ,( ). 50 me ( 'u( F 21 1 ( ). 
100 nig (bfFMl.O. 100 ma NaMo() 4 .2II,0. 100 mg 
Na,B 4 O 7 .10H 2 O, 10 ml of 1M CaCl,, and 10"ml of 0.5 M 
sodium citrate. 

B( 'F103-BB1 fusion protein could be easily visualized in 45 
samples from cell free supernatants (after 24 h of growth in 
FSB medium or 48 li in MBF> medium) as the majorprotein 
baud on SDS-PAGF' gels { 10°o NtiPAGF m Ml S buffer, run 
as described by the manufacturer. In\ itrogen) running at -44 
kDa bv usinu. standard protein stains ( c » t iel( "ode Blue Slain M , 
Reagent; Pierce), 'fhe identity of the BCP10.-BBI fusion 
protein was verified by immunoblots of SDS-PAGF gels 
using the protocols supplied by the manufacturer (BM Cliro- 
mogenic Western Blotting Kit: Roche \pplied Science using 
an anli-l IisTag antibody or an anti-BCF103 cellulase poly- 
clonal antibody for detection). 

To determine the BCE103 activity, cellulase degradation 
was assessed qualitatively on FA cellulase indicator plates 
(with l"o earboxymelln lcellulose stained with 0.2% Congo 
Red. or w ith 0.5" » a/o-t M-eellulose, Megazyme), or quan- 
titatively by a direct assay in \ssay Buffer (100 mMTris pH 60 
8.(i. 0.005% Tw een-80) on the culture broth using a the syn- 
thetic substrate. 4-nitrophenyl ('-D-eellobioside (Sigma), 
using methods known in the art (See e.g., van Tilbeurghet al., 
Meth. Enzymol.. 160:45-59 [1988]). 

i | in in libiton ssays e perfon 1 in As t\ Rul ei <« 
to determine the BBI aethity. Specifically, a standard curve 
was generated by making eleven 1 1 serial dilutions (100 uL 



BBI+100 ul, Assay Buffer) of a 2 ug/mL standard BBI solu- 
tion. The BBI standard was purified from a 1 mg/ml Trypsin- 
Chymotrypsm Inhibitor (Sigma Cat. tf'f-9777) solution in 20 
mM MES pH 6.0 usinu a hvdrophobic interaction column 
(POROS HP2, Phenyl column, Applied Biosy stems). The 
column was equilibrated with 20 mM MES pH 6.0, loaded 
with 5 mg of the inhibitor, washed with the equilibration 
buffer, and then the BBI waselutedw it h water Unknown BBI 
samples to be tested in the inhibitory assay were diluted as 
necessary, so that two or more data points would fall within 
the standard curve (usually 1:10, 1:100, 1:200, 1:1000, 
1:2000 sample dilutions were tested and then the dilutions 
fine tuned if necessary ). I aeh diluted BBI standard or sample, 
20 uL, was added to 80 ul of 50 ng ml bovine pancreatic 
trypsin ( Worlhmgton) (made by diluting a slock 1 mg/ml. 
trypsin solution into Assay Butter). For convenience, the 
standards and samples were arrayed in 96 well microliter 
plates. The reactions were mixed and incubated 1 5 min at 25° 
C. After die incubation. 100 itl of the 0.5 mg/ml trypsin 
substrate (diluted in Assav Buffer from a 1 00 mg/ml solution 
in DMSO), Suc-AAPR-pNA (succinyl-Ala-Ala-Pro-Arg- 
para-nitroanihde. Bachem). was added, mixed and the OD 
(A 405 ) was monitored for 1 5 min. with 1 time point recorded 
every 12 sec using a Spectra Max 250 (Molecular Devices). 
The data points were used to determine the Vmax for each 
reaction. The standard curve was generated by plotting Vmax 
versus BBI concentration and was fitted to a four-parameter 
curve. All data lining was done using software supplied by the 
manufacturer | Molecular I )evices).The BBI concentration of 
the unknown samples was calculated from the standard curve. 
\ltcrnati\ely. the BBI activity was measured using the same 
protocol but by determining bo\ inc pancreatic chymotrypsin 
I Worthington I inhibition (chymotrypsin was used at the same 
concentration as U'vpsm and ch\ motrvpsm aelw it v was mea- 
sured by adding 100 uL of a 0.4 mg/ml chymotrypsin sub- 
strate. succmvl-Ala-Ala-Pro-Phe-para-nitroanilide, 
Bachem). 

'Piters from sliake flask runs ( 500 ml MBD medium in 2.8 
L Fernbach 6 baffled fiasks, 37° C, 225 rpm. harvested 60 h 
after of growth) typically ranged from 0.4-0.9 mg/ml BCE 
activity and 40-150 ug ml BBI trypsin inhibitory activity. 
However, it is contemplated that titers likely could be 
improved further by optimizing the bacterial strain, culture 
-■dium and growth conditions i aeration, temperature, time 



ol liar 
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ln addition to the BCE 103 fusion to wild-type BBI, fusion 
proteins to BBI variants and fusion proteins with various 
linkers between BCE103 and BB] were produced using the 
methods outlined above, as described in the following 
Examples. In addition, fusion proteins were also produced 
w hen the BBI was fused to the 2"% BI) linker i Bl T-cbdD- 
BBI; See, F.xample4) making it possible to use the 1" CBD to 
a d m iht pi rilic ilion process 

Example 2 

Production of Peptides Substituted into the BBI 
Reactive Site l oops as BCF103-BBI Fusion Proteins 

In this Example, experiments conducted to produce pep- 
tides substituted into the BBI reactive site loops as BCE 103- 
BBI fusion proteins are described. The primers, as well as 
other sequences used in the various steps of these experiments 
are provided below. The sequence of 12BBlck81 from the 
BCE103 fusion site (at the BamHII) to the end of the gene is 
provided in FIG. 3. The CK37281 peptide sequences 
(ACYNLYGWTC (SEQ ID NO:9) are inserted into both the 
trypsin and chyniotry psin inhibitory loops. 

Fhe primers used to introduce an EcoRI site in the BBI 
gene using QuikChangeR site-directed mutagenesis (Strat- 
agene) were: 
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' -GATATGCGTCTGAATTCCTGTCATAGTGCAT ( 



' -ATGCACTATGACAGGAATTCAGACGCATATC (SEQ I 



5:18) 



l"hc sequences of the 1 )NA oligonucleotides that were 
annealed and cloned in the BBI gene ( Sacl-i ieoRI) to replace 
the trypsin inhibitory loop with the V'egF binding peptide , 

; ' i vere 



ACGACCACAGCTGCATTGATCGCAACAGGGTTTAGAGCT 



TGTCGCTGCAGCGATATGCGTCTG 



ACTTGTCGCTGCAGCGATATGCGT 



TAACATGCGCATTGATCGCAACAGGGTTTAGAGCT 

The sequences of the DNA oligonucleotides that were 
annealed and cloned in the BBI gene (EcoRI-Sall) to replace 
the chymotrypsin inhibitory loop with the V'egF binding pep- 
tide ( '107281 were: 



5 ' -AATTCCTGTCATAGTGCCTGCAAAAGGTGCGCATGTTATAACCTGTA 
CGGGTGGACCTGTTTTTGCG 



TTTGCAGGCACTATGACAGG 

3 

The DNA sequences of Ihe oligonucleotide pmrs used to 
make cassettes to introduce peptides into the trypsin (Sad and 
EcoRI restriction sites) or chymotypsin (EcoRl and Sail 
restriction sites) reactive site loops of the synthetic BBI gene 
are provided below. These peptide coding sequences were <f 
then moved into the p2JMKBBBI expression vector as Sacl- 
Sall fragments. 



AATTCAGACGCATATCGCTGCAGCGACATTCGTCCAGGTTAGAACGAGC 
ACAACCGCATTGATCGCAACAGGGTTTAGAGCT 

(SEQ ID 110:31) 
CTAAACCCTGTTGCGATCAATGCGGTTGTCAGCGTGCTCTGCCGATCCTG 



AATTCAGACGCATATCGCTGCAGCGACACAGGATCGGCAGAGCACGCTGA 
CAACCGCATTGATCGCAACAGGGTAGAGCT 

) P (SEQ ID 110:33) 

CTAAACCCTGTTGCGATCAATGCCAGTGTGGTCGTCTGCACATGAAAACC 

TGT 



AATTCAGACGCATATCGCTGCAGGGAC 



CACTGGCATTGATCGCAACAGGGTTTAGAGCT 



AGCTCAATGTTTTTGCG 



CACCGTTGTCGCTGCAGCGATATGCGTCTG 



GAACAACACATGCGATTGATCGCAACAGGGTTTAGAGCT 



TCGACGCAAAAACAACGGTGGTGACCCCAGTCCTGAACAACGCAGATAC 
AGCTTTTGCATG 



CTAAACCCTGTTGCGATCAATGCAGCTGTGGTCGTAAAATCCCGATCCA 



5 AATTCAGACGCATATCGCTGCAGCGACACTGTGGAGGGGTTGAGTACGTA 
CAGTTGCATTGATCGCAACAGGGTTTAGAGCT 

The DMA sequences ol oligonucleotide primer pairs used 
to introduce peptide sequence-- into the trypsin or chvmot- 

j rypsin reactive site loops using a QuikChange® II XL site- 
directed mutagenesis kit (Stratagene) are provided below. 
The reactions were performed as outlined by the manufac- 
turer and described in this Example. Twenty cycles were 
performed with extensions ol 6 minutes at 68° C, denatur- 

5 ations of 50 s at 95° C, and annealings at 55° C. for 50 s. After 
the cycles, a final extension was performed at 68° C, for 20 
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CGCAAAAACAGCCACGAGATTTTGAGC ,T"1 " , .1 ; 'TTTTGCATG 



CGCAAAAACATGTTGTCATTTGATTATACCAGCAGATACAGCTTTTGCATG 



CGCAAAAACATGAATTCGGGCCAAGTTGATGGCAGATACAGCTTTTGGATG 



CGCAAAAACAAGAATACGGTGCCCACGGATGGCAGATACAGCTTTTGCATG 



CGCAAAAACAGTGTTGAAGATAATGAAGATTGCAGATACAGCTTTTGCATG 



CGC.^A-/J-/JA'';J";:;CGAT~A;-.GAGACr,GTGTG':'AGATACAGCTTTTGC.-,T3 



CTGTATC'TGCGTTACAACATCAATGGGCATGTGTTTTTGCGTCGACATCAC 



CGCAAAAACACATGCCCATTGATGTTGTAACGCAGATACAGCTTTTGCATG 



CGCAAAAACACCAATCATACGGTGATGCGCGGCAGATACAGCTTTTGCATG 



CGCAAAAACATGTT B GCTTTTGCATG 



CTGTATCTGCTC - - - ' 



CGCAAAAACATGTAGAAAGCGC3AAAGCCTGAGCAGATACAGCTTTTGCATG 
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and 

CGGAACACCGGCAAAGCATAGTCCAGATCTCTTCACATGCGCATTGATCGCAACAGG 
CAAAAGCTGTGCTTGTGAAGAGATCTGGACTATGCTTTGCTTTTGCGTCGACATCACGG 

ACGCAAAAGCAAAGCATAGTCCAGATCTCTTCACAAGCACAGCTTTTGCATGCACTATG 

1C2 (1" loop) 

TCAATGCGCATGTTGGGCCCTTTTACTGTCAAAACATGCCGGTGTTCCGATATGCGTC 
CGGAACACCGGCATGTTTTGACAGTAAGGGCCCAACATGCGCATTGATCGCAACAGG 

1C2 <2" d loop) ^ GACATCACGG 

and 

ACGCAAAAGCATGTTTTGACAGTAAGGGCCCAACAAGCACAGCTTTTGCATGCACTATG 
2E2 (l" loop) 

TCAATGCGCATGTCTTACAGTACTGTGGACTACATGCCGGTGTTCCGATATGCGTC 

CGGAACACCGGCATGTAGTCCACAGTACTGTAAGACATGCGCATTGATCGCAACAGG 
2E2 (2 nd loop) 

CAAAAGCTGTGCTTGTCTTACAGTACTGTGGACTACATGCTTTTGCGTCGACATCACGG 
and 

ACGCAAAAGCATGTAGTCCACAGTACTGTAAGACAAGCACAGCTTTTGCATGCACTATG 
2E5 (1" loop) 

TCAATGCGCATGTACTCTTTGGAACAGATCTCCTTGCCGGTGTTCCGATATGCGTC 

CGGAACACCGGCAAGGAGATCTGTTCCAAAGAGTACATGCGCATTGATCGCAACAGG 
2ES (2 nd loop) 

CAAAAGCTGTGCTTGTACTGTTTGGAATCGATCTCCTTGCTTTTGCGTCGACATGACGG 

ACGCAAAAGCAAGGAGATCGATTCCAAAGAGTACAAGCACAGCTTTTGCATGGACTATG 
PGPns (1 st loop) 

TCAATGCGCATGTACAAACATCGATTCTACTCCTTGCCGGTGTTCCGATATGCGTC 

CGGAACACCGGCAAGGAGTAGAATCGATGTTTGTACATGCGCATTGATCGCAACAGG 
FGFns (2" a loop) 

CAAAAGCTGTGCTTGCACAAACATCGATTCTACTCCTTGTTTTTGCGTCGACATCACGG 

ACGCAAAAACAAGGAGTAGAATCGATGTTTGTGCAAGCACAGCTTTTGCATGCACTATG 
PGFkr il« loop) 

TCARTGCGCATGTACAAAAATCGATCGTACTCCTTGCCGGTGTTCCGATATGCGTC 
and 

CGGAACACCGGCAAGGAGTACGATCGATTTTTGTACATGCGCATTGATCGCAACAGG 
FGF);r :2 ' loon i 

CAAAAGCTGTGCTTGCACAAAAATCGATCGTACTCCTTGTTTTTGCGTCGACATCACGG 
and 

ACGCAAAAACAAGGAGTAGGATCGATTTTTGTGCAAGCACAGCTTTTGCATGCACTATG 



CGGAACACCGGCATGTTTCAGTTGTCTGCAGGTGACATGCGCATTGATCGCAACAGG 
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ACGCAAAAACATGTTTCAGTTGTCTGCAGGTGGCAAGCACAGCTTTTGCATGCACTATG 



(SEQ ID N0i9 
(SEQ ID N0:9 



CGGAACACCGGCAAATC&AT T l- T -TT -T 

FGFqy (2'" 1 loop) 



ACGCAAAAACAAATCGATGGGATGAAGTAGCCGCAAGCACAGCTTTTGCATGCACTATG 



CGGAACACCGGCATTTGTTAGCAAGGATACGTAAACATGCGCATTGATCGCAACAGG 



TTTTGCGTCGACATCACGG 



ACGCAAAAACATTTGTTAGCAAGGATACGTAAGCAAGCACAGCTTTTGCATGCACTATG 



GGAACACCGACATAAAGGATATGGTTGAGTTCTGCAGGCGCATTGATCGCAACAGGGTT 
MM00 7 (2" d loop) 



ACGCAAAAACAAAGTGGGTAAGGTTGTGTTTCTGCAGGCACAGCTTTTGCATGCACTATG 



- ■ i ,1 il , , ' , ;~ACAGCTTTT( 



CGGAACACCGACAGTT/.G-^T^GTTGGAr,GAG':l-CATGCGCATTGATCGC.iA:\-.GG 



CAAAAGCTGTGCCTGCGCGCTTCCTACACACTCTAACTGTTTTTGCGTCGACATCACGG 



ACGCAAiAAACAAGGTGGGCAAAGGACTAAAGGGCAGGCACAGCTITT.iCATGt'ACTATG 



\ca4Caaiaaacaaggtatagaatcgat:jttci"agcagatacagctt a:gcatg-act 



GGAACACCGACAAGGAGTACTGTCAATCCAAGTACAGATGCATTGATCGCAACAGGGTTT 
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-continued 



ACGCAAAAACAAG 1 I I TAC nGCTTTTGCATGCACT (SEQ ID HO:118l 

(SEQ ID HO: 119) 

(SEQ ID NO: 120) 

(SEQ ID HO:121) 
and 

ACGCAAAAACATTCGATCCAATCTGTAGTAACACATGCGCAGCTTTTGCATGCACTATG (SEQ ID NO: 122) 

ACGCAAAAACACATATGAGTCCAAAGTGTTGGGCAGGCACAGCTTTTGCATGCACTATGAC (SEQ ID NO: 124) 

T&rGCATGTTACTACTCTCAATTCCACCAATGTTTTTGCGTCGACATCACGG (SEQ ID NO: 12 5) 

ACGCAAAAACATTGGTGGAATTGAGAGTAGTAACATGCGCAGCTTTTGCATGCACTATG (SEQ ID N0:12«) 
TGPpsl (2 nd loop) 

CAAAAGCTGTCTTTGTCCGGAAAACGATAACGTTTCTCCTTGTAATTGCGTCGACATCACGGACTTCTG (SEQ ID NO: 127 1 

TGTCGACGCAATTACAAGGAGAAACGTTATCGTTTTCCGGACAAAGACAGCTTTTGCATGCACTATGAC (SEQ ID NO: 128) 



The DNA sequences of the oligonucleotide pair used to 
make the cassette to introduce the MM021 peptide into the 
chymotrypsin reactive site loops of the p2JM103-lnk2-BBI 
expression vector are provided below. The cassette was 
ligated into the SphI and Sail restriction sites in the vector. 

MM021 (2 nd loop) 

CAAAAGCTGTGCTTGTAAACACAACGTACGTCTTTTATGTTTTTGCG 

TCGACGCAAAAACATAAAAGACGTACGTTGTGTTTACAAGCACAGC 
TTTTGCATG 

I .ibraries made of cysteine constrained peptides are popu- 
lar reagents (e.g. the commercially available PhD-C7C Phage 
Display Peptide Library Kit: Nl-'H) for selecting peptides that 
bind to substrates of interest. BBI has two cysteine con- 
strained reactive site loops that arc structurally similar to the 
peptide loops displayed in various methods used to select 
peptide binders. So. once a cysteine constrained binding pep- 
tide has been selected, BBI is suitable for use as a scaffold to 
present the peptide in a binding reaction. 

The VegF binding peptide CK37281 (See e.g.. co-pending 
U.S. Provisional Patent Application Ser. No. 60/520,403, 
fded Nov. 13, 2003. incorporated herein by reference) was 
grafted into BBI by replacing the trypsin, chymotrypsin. or 
both reactive site loops, with the CK37281 peptide sequence 
(ACYNLYGWTCXSEQ ID NO:9) by using DNA oligo- 
nucleotide cassettes, lb facilitate the construction, an EcoRI 
site was introduced in the coding region of the BBI gene 
(custom synthesized by ( )peron fechnologies; See, Example 
1 ) between the trypsin and chymotrypsin reactive site loops 



by QiiikChungeie site-directed mutagenesis, using methods 
described by the manufacturer (Stratagene) using the primers 

35 BowBeco-F and BowBeco-R, shown above (0.5 pmol of each 
primer was used in the QuikChangeR reaction; after an initial 
denaturation step of 97° C. for 3 minutes, 1 8 PCR cycles of 
68° C. for 12 minutes, 95° C. for 30 seconds and 55° C. for 
one minute, followed bv a final extension reaction for 15 
minutes at 68° C). 

To replace the trypsin inhibitory peptide loop, two DNA 
oligonucleotides (IBBCK8 1 + and IBBCk8 1 -) were annealed 
and ligated into the Sad and LcoRI restriction sites. Like- 

45 wise, to replace the chymotrypsin inhibitory peptide loop. 
EcoRI and Sail si tes w ere used for insertion of a DNA cassette 
made by annealing the oligonucleotides (2BBck81+ an 
2BBck81-). The CK37281 peptide was grafted into both 
loops by inserting the ( "K 3728 1 peptide in the chymotrypsin 

,, loop (using the oligonucleotides (2BBck81+ an 2BBck81-) 
alter the trypsin loop was first replaced by the CK.3728I 
peptide. BBI with the CK372S1 peptide in the trypsin loop 
(lBBlckHl) was moved into the pJM 1 irMilil expression vec- 
tor as a Sacl-Sphl fragment. BBI with the CK37281 in the 

,5 chymotrv psm loop (2BBIck81), or both loops (12BBIck8 I i. 
was moved into pJMHBBBI as Sacl-Sall fragments. The 
correct sequences were verilied by DNA sequencing (the 
sequenceof 12BBIckH 1 aene is shown in I K r. ^ ). Hie result- 
ing vectors, pJM103-lBBIck81. P JM103-2BBIck81 . or 

.„ p.lM103-12BBlek81. were used to transfonn B. subtilis 
BG3934comK, and the pre >ductii >n of the 1 i( L fusion protei ns 
was determined as in F.xample 1 above. 

The fusion protein running at -44 kDa was detected by 
SDS-PAGE to be the major protein present in the cell free 

:is broth. Although in some cases, there was signilicanl degra- 
dation (up to 50%) of die BBI moiety (especially after >48 h 
of growth in MBD me ' i 1 e<i by the prt i 
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a prominent protein i ind running it M kl )a corresponding 
to the BCE 1 03 catalytic core. In these cases, the titers of the 
BCE 103 cellulase were similar n. thai measured with fusions 
lo the wild-type BB1 (Example 1). hut the activity of the BBI 
(trypsin inhibition with 2BBlckXE or chymotrypsin inhibi- < 
tion with lBBIckSl) was generally about two fold less. 

To reduce the proteolytic degradation of BBI variants dur- 
ing growth (i.e. decrease the amount of BCE103 cellulase 
core present on SI )S-PAGE gels in comparison to the fusion 
protein), a Bacillus subtilis strain with nine protease genes io 
deleted. BG6006 (degU y/, 32. oppA. Aspolii ^01. AaprE, 
AnprE, Aepr. AispA. Abpr. Avpr AwprA. \mpr-ybjl'. AnprB, 
amyli: :xy IK PxvlAca >mK -erm( ' ). was used as an expression 
host, and the grow th temperature ( 35' ( '.) and aeration (200 
rpm) were reduced. With these changes, a major lusion pro- . ^ 
tern band (- 44 kl )a) was observ ed on SI )S-PAG1 gels with an 
insignificant band present al the molecular weight expected 
for the BCE catalytic core protein (-34 kDa). 

In addition to the CK37281 peptide, a number of other 
cysteine constrained peptides were produced when substi- 
tuted into the trypsin and/or chymotrvpsin reactive site loops 2 " 
of BBI fused to the C-terminus of the BCE103 cellulase. 
Specific examples included: 

(1) Peptides designed or selected as complement antago- 
nists, compstatin introduced into the 1" or 2" d reactive 
site loops (See. Sahu et al.. J. Immunol.. 157: 884-891. 25 
[ 1 996] ), C2c ( 1 " loop), C3c ( 1 * loop), C4c ( 1 " loop ) and 

( '5c (1" loop): or peptides selected in a I 'actor B bmdina 
reaction IB, 2B, 4A, 5 A, 6-1 A, 7 A. 8B, 9A, 1 OB, 1 1 - 1 A, 
12B, 13 A, and 15-1 A (all in 2" d loop): 

(2) Peptides designed to bind to the proteases Factor Xa or 30 
stratum corenum chymotrypsin, Xal (2 nd loop) or 
hSCCl (1" loop), respectively; 

(3) Peptides selected in FGF5 binding reactions 1 A6 (l"or 
2 nd loop), 1C2 (1" or 2nd loop), (1" or 2" d loop), 2E5 
(IT 2'"' or both loops). I (dais ( 1 s ' or 2""' loop). FCiFkr 
( 1 * or 2 nd loop ), FGFhl (1 " or 2" d loop ). FGFgy ( 1" or 
2" loop), MM005 ( 1 " or 2'" loop), MM007 ( I", 2 nd or 
bodi loop. i. MM0OO I 2" " loop). MM010 ( 1 ". 2'"' or both 
loops), MM01 7(2'"' loop), EGEpsl (2"'' loop), FGFps2 
(I s , 2" rf orboth loops), and FGFpsB (2" d loop); and 

(4) Peptides selected in l'GF|M bindinu reactions 1A8(2'" J ' 
loop), 1A12 (2 nd loop), 1E1 1 (2 nd loop). TGFpsl (2 nd 
loop), and MM021 (2"'' loop). 

I'he oligonucleotides used to introduce these peptides into H 
either the trypsin ( 1 ' loop ) or chymotrypsin (2'" loop) reac- 
tive site loops, and methods used to graft these peptides into 
BBI, are provided above. In all eases, lusion proteins were 
produced as determined b\ SDS-PAGE gels. However, with 
some substituted peptides, the amount of intact lusion protein 
was increased by reducing the proteolytic degradation as 
described above for the CK37281 substituted peptide. 

Example 3 

Activation of BBI by T hiol Reducing/Oxidizing 5:1 

Aftergrowth, the activ ity of the BBI (by trysin or chymot- 
rvpsin inl lion) is typii illy some 5-20 times lower than 
what w ould be expected from the activity of the BO- 103 "" 
cellulase measured in the cell free supernatants (the two mol- 
ecules should be present at a 1:1 molar ratio in the fusion 
protein). An increase in the activity of BBI (measured by 
either trypsin or chymotrypsin inhibition) in the BCE 103- 
*BI ft) 1 n 1 1 san 11 b 11 uti el I 1 1111 I 11 ! VI) 

t\ pit ally concentrations of I --4 111M added to the M Bl ) grow th 
medium about 14 It after inoculation. The trypsin or chymot- 



rypsin inhibitory activity of BBI in the fusion protein is also 
lower than expected when binding peptides (e.g. VegF bind- 
ing peptide CK37281) replace the chymotrypsin or trypsin 
reacth e site loop, respectrv ely As w ith the wild-type BBI, the 
inhibitory activity can be increased by treatment with bME. 
Unexpectedly, other thiol reducing agents (e.g., cysteine, 
reduced glutathione. 1)1 -dithiothreitol and I ris| 2-earboxy- 
ethyl|phosphine) hod small or negligible effects on the acti- 
vation of BBI during growth in these experiments. Also, 
additions of antioxidants (e.g.. ascorbic acid or DL-a-toco- 
pherol acetate) or other adjuvants to the growth medium (e.g., 
isoleucine, soybean oil, Tween-80), or growth at 30° C. did 
not significantly improve the BCE103:BBI activity ratio. 

Specifically, to determine the BBI activation during 
growth, cultures of B. subtilis BG6006 transformed with 
P 2JM103-E3-2BBIck81 (See. Example 4. below) were 
grown in 40 ml MBD medium in 250 ml shake flasks at 37° C. 
for 13 h. Then, the thiol reducing agents indicated on the 
graph in FIG. 4 were added and cell sup ematants harvested 
after 62 h of grow th. '1 he reagents 2-mercaptoethanol (BME 1. 
cysteine (Cys). reduced glutathione (Glut), and 1)1 -dithio- 
threitol (D i l i were added to the growth medium to the final 
concentrations indicated on the graph provided in FIG. 4. 
Concentrations of 5 mM (3 ME can result in better BCE 103: 
BBI activ itv ratios but tvpicallv result in an o\ erall decrease 
in both BCE 103 and BBI titers (see FIG. 4), at least partially 
due lo the reduction in bacterial growth caused by the added 
reagent. Titers of BCE 103 and 2BBIck81 were determined 
using the assays described in Example 1 . 

BBI activation was also achieved after partial purification 
of the fusion proteins (e.g. BCE-lnk2-2BBIck81, see 
Example 4 below) by Q-Sepharose ion exchange chromatog- 

Tlie fusion protein was purified from cell free broth 
obtained from shake flasks or fermentor runs. The broth was 
filtered, diluted five lo ten fold in water and the pH adjusted to 
pH 7.5-8.0. The diluted sample was loaded onto a column 
packed with Q-Sepharose resin (Gl I leahhcare). The column 
was washed with 50 mM Tris pfl 7.5 and then washed again 
in the same buffer containing 300 mM NaCl. The fusion 
protein was eluted in the same bu ffer with 700 mM NaCl. 

To activate the BBI, the pooled fusion protein fractions 
were diluted ten ink! in \ssa\ Bullerlhen treated with 2 mM 
fiMEandO 2 mMoxidi/cd glutathione (GSSG) with constant 
mixing on a stir plate or rocker plat form lor about 24 hat room 
temperature. The BBI could general!}' be activated to about 
70-100% of the expected trypsin inhibitory activity based 011 
the measured concentration of the IK E 1 03 " cellulase. 
Although the activation method outlined above generally 
yielded the best results, in some cases, in order lo maximize 
the activation of a given sample, screens were performed in 
96-well plates to determine the optimal conditions. Initially, 
the typical conditions screened were the dilution in Assay 
Bull'er (e.g.. ,1 2-50 lold dilution series). |)ME concentration 
(e.g.. series between 0.5-5 mM) and oxidi/ed glutathione 
concentration (e g 0 mM then a series of 1/20 to l/2the(5ME 
concentration). 

The activation of the fusion protein BCE-Ink2-2BBIck81 
is shown in FIG. 5 In this specific example, the fusion protein 
from a Q-Sepharose purification was diluted 1.10 in Dulbec- 
co' s PBS (Mediatech) with 0.005% 1 WEENR-SO. Beta-mer- 
captoethanol was added to a final concentration of 3 mM and 
incubated overnight at room temperature on a rocker. The 
sample was further incubated at room temperature for about 
60 h with vigorous stirring on a maunetic stir plate. The titers 
of the BCE103 and 2BBIck81 (before and after fSME treat- 
ment) were determined by cellulase assav sand trypsin inhibi- 
tory assays, respectively. 

In some embodiments, such as for activating BBI or it 
variants in cell free broth front large volume fermentations, it 
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is desirable to reduce the dilution and |SME concentration in 
the activation reaction. This can be accomplished by using 
higher concentrations of buffer (500 mM Tris pH 8.6), or 
changing to zwitterionic buffers such as CHRS (also CAPS, 
Tricine, TAPS, and other suitable zwitterionic buffers). For " 
example, cell free broth (or fusion protein fractions purified 
by ion exchange chromatography) was diluted 1 : 1 in 375 mM 
CUES pH 8.6 with 0.005% 1 W I • EN i- -X( i then activated with 
i m.V! |)V1I and lOniM Na St ), and incubated with stirring al 
room temperature for about 24 h. BBI or its variants, as 
BCIT03 cellulase fusion proteins, were routinely activated 
by this method to 70-100% of the expected value (based on 
BCE103 cellulase activities). 

1 

Example 4 
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The sequences of the 1 )NA oligonucleotide pair that was 
annealed and ligaled into the Rami I! and Sacl sites of p2JM 
I03-BB1 to fuse BBI to the 2'"' OBI) linker of BCE103 cel- 
lulase are provided below. 



AGCATCGGATTCAAATCAAATTTACACAAATGAAATTGTGTATCATAACG 



This Example describes experiments developed to release 
free BBI or its variants by cleavage of the BCE103-BBI 
fusion proteins. 

'Die sequences of the DNA oligonucleotide pairs that were " 
annealed and ligated into the BamHI and Sacl sites of 
pJM103-BBI to generate potential cleavage sites during cul- 
ture growth between the BCE103 catalytic domain and BBI 
are provided below. , 



The peptide sequences susceptible to ac 
between aspartic acid and proline residues a 



cleavage 
i provided 



G ATC C AGG TGGA.GC TGC T T T AGT TGACG AT G AGAGC T 



Linker 3- (SEQ ID NO:143) (Kemperman et al . , Appl . 

Oligonucleotide primers used to introduce a BssHH site 
into pJM 103BBI by QuikChange® site-directed mutagen- 
esis are provided below. 



CTCATCGTCCCAAGCTGGGTATTCTOCAGGATCAGATGGAGGAGTTGGGT 



Sequences of the DNA oligonucleotides that were 
Mi annealed as a cassette ( Sal 1-1 iindlll ) to introduce I lindlli and 
Xhol sites after the stop codon of BBI. to introduce a Pact site 
after the LAT, and remove the original Ilindlll site are pro- 
vided below. 

55 

(SEQ ID NO : 13b! BCEterm+ ( SEQ ID NO: 146) 

CTCATCGTCATTGCTGTGCTGATGGTCTGATTTCATAAGCAGGCCAAGCT 5' -GACATCACGGACTTCTGCTATGAGCCATGTAAACCAAGCGAGGACGA 

TAGACAGACGCAGGTTCGCCG TSAAGAGAACTAAAAGCTTAACTCGAGGTTAACAGAGGACGGATTTCCTG 



BCEshortproBBI 00 AAGGAAATCCGTTTTTTATTTTTAATTAAG 

P P !SEQ ID HO:137) BCEternH- (SEQ ID HO:147) 
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PCR primers used lo generate the acid labile linkers pro- 
v ided above (i.e.. Linker 1, Linker 2, and Linker 3) inserted 
between the BCE103 catalytic domain and BBI are provided 



(Enteropeptidase cleaveage site) 

(SEQ I 

g.-T(-"Aggt(;:;:.g-.c:;a''gatga''AaA3ac:;atuagagct 



' -CAGCAACATGAGCGCGCAGGCTG 



I SEQ I 
(SEQ I 



lt'R primers used 
vided above (i.e., Linker 
into the 1" CBD linker. 



ihe acid labile linkers pm- 
Linker 2, and Linker 3) inserted 



The sequences of the DNA oligonucleotide pairs that were 
annealed and ligated into the Hand II and Sael sites of 
pJM103-luk2-lBBIck81 to generate potential cleavage sites 
' between the BCE103 catalytic domain and BUI during the 
purification process are provided below. 



BCEgenen2BBI (Genenase 



' -ATCGTCTGGATCCGGGTTGGGATCTGCAACTACCGGTTCTCCTGGAT 



Protein sequence oft he acid labile linkers inserted between 
the BCE103 catalytic domain and BBI are provided below. 
The acid labile linkers are shown in bold type and the 
sequences from the first CBD domain are underlined. 4 



YTGAGAGCT 



Sequences of Ihe oligonucleotide primer pairs used to 
introduce the f and 13 linkers in I inker 2 by QuikC 'hange 
site-directed mutagenensis iSlratagene) are provided below. 



(SEQ I 
(SEQ I 



BCE-WADPN-PDP-BBI 



BGE- I D 



■ MPPSPPTPPSPPGEP -WADPN-PPP-B 



The sequences of the DNA oligonucleotide pairs that w 
annealed and ligated into the BamHi and SacI sites 



i The protein sequence of the E3 linker between the BCE 1 0.3 
catalytic domain and BBI was DNNDPIPEPDDESFN- 
MPIPEP (SEQ ID NO: 174). In tins sequence, the E Linker is 
underlined and the sequence generated by faulty recombina- 
tion in if. coli is shown in bold type. Cleavage by Mpr (or V8 
j protease) can occur after any of the three glutamic acids 
present in the F.3 I inker Thus, the structure was BCETSEQ 
lDNO:174)-BBI. 

The sequences ol the I )NA oligonucleotide pairs thai were 
annealed and ligaied into the Band II and SacI sites of 



I 1 'It i i i i I \ i 1 i en 1 Ml \ HJlckKI rate potential (iciieu I 

BCE103 catalytic domain and BBI during the purification cleavage sites between ihe B< 'El 03 eat.! k tie di imain and BBI 
process are pro\ ided below are provided below. 
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GATCCAGGCGCTGCACACTACAAATCAGACCATCAGCACAGCAATGACGA 



CTCATCGTCATTGCTGTGCTGATGGTCTGATTTGTAGTGTGCAGCGCCTG 



GAT r 



CTCATCGTCTTGAAATTCTACGTAGTGTGCAGCGCCTG 

The protein sequence of a Genenase I sensitive cleavage 
site (also acid and Mpr sensitive! inserted between the 
BCE103 catalytic domain and BBI was DNNDPIPDP- 
GAAHYVEFQ (SHQ ID NO: 179). The Genenase 1 site 
(Gen4 Linker) is in bold type (cleavage occurs between the 
tyrosine and valine) ( NEB) and Linker 2 is underlined. Cleav- 
age by Mpr can also occur after the glutamic acid that follows 
the valine in the Gen4 linker. lhe sequence used herein was 
BCE-SEQ IDNO:179)-BBI 

Cleavage sites in the BCE103-lnk2-2BBIck81 fusion pro- 
tein are indicated below. The C-terminal seven amino acids of 
the BCE103 catalytic domain (underlined), linker 2 sequence 
(bold type), and 2BBIck81 sequences are shown. The acid/ 
heat labile Asp-Pro bonds are indicated with solid headed 
arrows and the Mpr sensitive bonds after glutamic acids are 
indicated with line headed arrows. 



V ▼ ▼ V (SEQ ID NO: 1 

...KIRESASD NNDPIPDPDDESSKPCCDQCACTKSNPPQCRCSDMRL 
HSCHSACKSCACYNDYGKTCFCVDITDFCYEPCKPSEDDKEN 



rowth. dur- 



hi order to isolate free BBI or its variants, the BBI 
needs to be cleaved from the BCE103-BBI fusion pro 
some embodiments, this is accomplished during gro' 
proteases intrinsically produced b\ B. subtil 
native embodiments, this cleavage occurs a 
ing the purification process (e.g. by acid heat or proteolytic 
cleavage). Linkers potentially susceptible to cleavage during 
urowth were designed iSee. above, sub. cbdl . pro. shortpro. 
and cbdD) and cloned into the p.IM 103BBI or p2JM103BBI 
expression vectors as Baml 11-Sacl cassettes. Hie production 
of fusion protein versus B( "1 103 catalytic domain was ana- 
lyzed on SDS-PAGH gels as described in Example 1. 

Little cleavage of the fusion protein was observed for all 
these linkers except with the pro linker, which was nearly 
completely cleaved so that very little intact fusion protein was 
observed on gels, although there was a large band correspond- 
ing to the BCE103 catalytic core. I Infortimately, this cleavage 
during growth resulted in negligible BBI activity measured in 
cell free supematants and no BBI band could be identified on 
SI )S-PAGlv gels. Although it is not intended that the present 
invention be limited to any particular mechanism or theory, it 
is possible that the BBI is particularly sensitive to proteolytic 
degradation in its inactive form. Thus, cleavage during the 
purification process after activation is generally preferred. 

In some embodiments, the bonds betw> 
proline residues are cleaved by heat trc 



known in the art (See e.g., Landon, Meth. Enzymol., 47: 145- 



149 [1977]). The 1" ( BD linker in the BCE103 cellulasehas 
three Asp-Pro dipeptide sequences (See. FIG. 1) with the 
potential to be cleaved by acid heat treatment. However, 
cleavage by acid heat treatment at these sites w as found to be 
s inefficient. Protein sequences that are especially labile to 
acid heat have been described in the literature, three of such 
sequences are WGDPHY (SEQ ID NO: 141 ), DNNDPI (SEQ 
ID NO: 142), and WADPN (SEQ ID NO: 143)(i.e., Linkers 1 , 
2 and 3). 

10 Before these acid labile linkers were introduced into the 
BCE103-BBI expression vector. p.lMKBBBl. a BssHII site 
was introduced by QuikChange® XL (Stratagene) mutagen- 
esis (using the manufacturer's methods: and described in 
Example 2 above, except 8 minute extension and 1 minute 
15 denaturation steps were used) in the aprE signal sequence 
coding region using the oligonucleotide primers BCHbss-F 
and BCEbss-R (provided above). Then, Hindlll and Xhol 
sites were inserted in front of the LAT terminator (after the 
BBI stop codon I and a Pad site w as added a Iter the terminator 
20 (the original Hindlll site after the LAT terminator was 
removed) by inserting an oligonucleotide cassette (BCEt- 
erm+ and BCEterm-: provided above) into the Sail and the 
original Hindlll sites. This new vector was called 
"p2JM103BBI." 
25 The acid labile linker fragments were generated by PCR, 
using forward primer BCE103coreBssI IIIJ'W with each of 
the reverse primers, linker WGDPHY_R, linker DNND- 
PI_RV; or linkerWADPN_RV (the sequences of which are 
all provided above) and p2JM103BBl as the template (see 
30 Example 1 for the PCR protocol ). The PCR fragments of 970 
bp weredigested with BamHl and Pstl. the 1 54 bp fragments 
encoding the acid linker fragments were isolated from an 
agarose gel after electrophoresis, and ligated into the 
p2JM10.3 vector digested with BamHl and Pstl that had also 
is been purified from a gel. The linker sequences in the final 
expression vectors. p2JM1031nkl-BBl. p2JM 1031nk2-BBI 
and p2JM1031nk3-BBI, were verified by DNA sequencing. 

Competent B. subtilis strain BG3934comK or BG6006 
were transformed with the plasmids, colonies selected on 5 
40 ug/ml chloramphenicol LA plates and amplified to 25 ug/ml 
chloramphenicol as described in Example 1 . 

Similarly, the acid labile linkers were inserted into the first 
CBD linker. Specifically. PCR fragments were generated 
using the forward primer BCE103corePstl FW with the 
reverse primers I .plusWGDPHY_RV. LplusDNNDPI_RV. or 
Lplus\A'ADPN_RV (See above, for the sequences) with 
p2JM 1 03BBI as a template. The PCR fragments of about I 50 
bp were digested with BamHl and Pstl. purified and ligated to 
the p2.IM103BBI vector digested with Baml II and Pstl. The 
correct sequences were verified by I )NA sequencing and the 
plasmids p2JM103pllnkl-BBI, p2JM103pllnk2-BBI and 
P 2JM103pllnk3-BBI were used to transform B. subtilis 
strains as described above. 

After growth in MB1) medium, the fusion proteins were 
purified by ion exchange chromatography essentially as 
described above (See. Example 2). Hie fusion protein was 
cleaved by treatment at 55° C. for 16 h in 10% formic acid. 
The BCE103 catalytic domain precipitated during the acid 
treatment and was removed by centrifugation. The free BBI in 
the supernatant was dried overnight on a SpeedVac. The 
sample was suspended in 50 inM ! ris pi i S before loading on 
the SDS-PAGE gel. By analysis of the protein stained SDS- 
PAGE gels, it was observed that acid cleavage was much more 
efficient in the fusion proteins where Linker 2 was inserted 
aspartic acid and 65 between the BCE103 catalytic domain and BBI (BCE- 
it at acidic pH as DNNDPI-PDP-BBI). This linker was found to be cleaved in 



couple of hours at 75° C. in 20 mM glycine pH 2. 
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were tested for 
ere cleaved by 
n temperature. 



In alternative embodiments, the fusion protein was cleaved 
by treatment with a protease during the purification process. 
Linkers were designed with cleavage sites for glutamic acid 
specific proteases (e.g.. Mpror V8 protease), Furin/blisterase. 
Genenase I, and Enteropeptidase (Enterokinase). These link- 
ers were introduced as oligonucleotide cassettes (See above, 
for the sequences) between the B( 'El 03 catalytic core and 
BBI in the expression vector using the BaniHI and Sacl sites 
(See. FKi. 1 ). In the coding region of the original expression 
vector (pJM103BBI), there is a glutamic acid residue in the 
l" CHI) domain and at the third residue in BBI l See. FKi. I ). 
which is contemplated to be susceptible to cleavage by 
glutamic acid specific proteases such as B. subtilis Mpr 
(BsMpr) or V8 protease. However, neither BsMpr nor V8 
protease were found to cleave the BCE-BB1 fusion protein 
very efficiently at these sites. Thus, it w as necessary to design 
other linkers that were susceptible to cleavage by these pro- 

The six acid labile linkers described above 
cleavage by BsMpr. These fusion proteins w 
treatment for 1 6 h with 1 6 ug of BsMpr at root 
After cleavage, the BCE103 catalytic domain was precipi- 
tated by the addition of 10% formic acid and removed by 
centrifugation. The free BBI in the supernatant was dried 
overnight on a SpeedVac. ihe sample was suspended in 50 
mMTris pi I 8, before loading on the SDS-PAGE. Similar to 
the acid cleavage, the BCE-DNNDPI-PDP-BBI (Linker 2) 
fusion protein was much more efficiently cleaved by BsMpr 
than any of the other linkers. Therefore. BBI and its variants 
were found to be effectively released from the BCE- 
DNNDPI-PDP-BBI fusion protein either by acid/heat treat- 
ment or proteolytic digestion with a glutamic acid specific 
protease such as BsMpr. Several other linkers designed for 
cleaved by Mpr (e.g., E, F.3 linker, and fie, provided above) 
were tested but nunc ofthem had any advantage; 
2 (the E3 linker was generated by faulty recombination 
coli after transformation with the Quikt hange R site-directed 
mutagensis reaction designed to construct the H linker] As 
shown above, there are two acid heat labile cleavage sites in 
Linker 2 and three sites sensitive to cleavage by Mpr. 

Linkers designed for cleavage by Furin or Blisterase 
(NEB) (BCHfurinBBI), or Enteropeptidase (Enterokinase. 
NEB) (BCEentBBI) were tested, but none of these sequences 
were cleaved efficiently by the appropriate protease. Four 
linkers were also designed (BCEgenenlBBI, 
B("Egaien2BBI.BCEgenen3BBi. and BCEgenen4BBI)and 
tested for cleavage by ( ienenase I ( NEB i. Efficient cleavage 
of the fusion protein was observed only with the tien4 I inker 
(BCEgenen4BBI). BsMpr was also found to efficiently 
cleave the Gen4 linker. 

After activation of the purified BCE-lnk2-2BBIck81 
fusion protein, cleavage by BsMpr does not go to completion 
as judged by SDS-PAGE gels. However, it was discovered 
that complete cleavage alter activation of BCE-BBI fusion 
proteins with I inker 2 (or the Gen4 linker) can be accom- 
plished by using the Mpr protease isolated from Bacillus 
licheniformis (BIMpr). While it is not intended that the 
present invention be limited to any particular mechanism, 
cleavage after the third amino acid in mature BBI appeared to 
be more sensitive to BIMpr while cleavage after the sixth 
amino acid from the C '-terminus of RBI is more sensitive to 
BsMpr cleavage. 

In some embodiments, after cleavage, the BBI is purified 
away from the BCE103 catalytic domain by selective acid 
precipitation (pH 3 orlower)oftheBC E103 catalytic domain 
as described above, ion exchange chromatography (See, 
Example 5), or by selective binding of BBI on an anhydrot- 
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I ii rose (Sign lumn loaded in 50 m> [ 
washed with 50 mM I ris pH 8.0 with 150 mM NaCl. then 
elutuig bound BBI with 50 mM ulvcinepH 2.2 with 300 mM 

NaCl). 

Example 5 

Binding ofBBIckSl toVegF 

i In this Example, experiments conducted to assess the bind- 
ing of BBIck81 to VegF are described. The BCE103-lnk2- 
2BBIck81 fusion protein was produced in B. subtilis as 
described in Example 2. The fusion protein was purified, and 
the BBI trypsin inhibitory aetiv ity was increased by treatment 
with pME and oxidized glutathione as described in Example 
3. The fusion protein was eleav ed by BsMpr protease (See, 
Example 4) and the free 2BBIck8l' was purified from the 
BCE10.3 catalytic domain by ion exchange chromatography 
using a Q-Scpharose column. 

Briefly, after cleavage, the pi I of the cleaved sample was 
adjusted to 5.5. the sample was then loaded onto the column 
(equilibrated with 25 mM MES pH 5.5). The free 2BBIck81 
was washed through the column using 25 mM sodium acetate 
pH 5.0 while the BCE103 catalytic core remained bound to 
the resin. The 2BBIck81 fraction was concentrated by ultra- 
filtration and analyzed using an electrochemiluminescence 
(ECL) based binding assay (BioVeris). The Anti-VegF anti- 
body (Santa Cruz) and VegF (PcproTech) were labeled with 
the electrochemiluminescent dye and biotin. respectively, as 
described by the manufacturer (BioVeris). All materials were 
in Dulbecco's PBS (Mediatech) supplemented with 0.1% 
TWEEN&-80. An initial dilution series of Anti-VegF anti- 
body ( 1 25, 250 and 500 ng/ml) and VegF ( 1 00. 1 50, 200 and 
250 ng/ml) were tested in the binding assay to determine the 
Linker 35 concentrations of each that would give a robust ECL signal. 

For testing 2BBlek8 1 binding, 50 uL of 500 ng/ml ECL 
labeled Anti-VegF antibody. 50 ul . of 250 ng/ml biotinylated 
VegF and 100 ul 2BBIck81 (series of 12.5. 15. 31.25. 62.5, 
1 25. 250 or 500 ng ml i w ere incubated at room temperature 
for 2 h with shaking. Then. 50 uL of 0.2 mg/ml streptavidin 
coaled beads were added and the reaction was incubated at 
room temperature for 30 minutes. The ECL signal was mea- 
sured using a BioVeris M8. 3X4 \naly/er as described by the 
manufacturer (BioVeris) As shown in FIG. 6. the ECI, signal 
decreased as increasing concentrations of 2BBlck81 dis- 
placed more of the labeled Anti-VegF antibody bound to VegF 
attached to the magnetic beads. 

1 hus. the ( 'K3 7 2S 1 peptide w hen graited onto the chymot- 
rypsm inhibitory loopoi BBI C2BBlek8 1 ) competed with the 
Anti-VegF antibody ior binding to VegF at micromolar con- 
centrations. In fact. 2BBIck81 competed for VegF binding 
better than the synthesized ( 'K3728 1 peptide itself (See. FIG. 
6). The CK37281 peptide inserted into the trypsin inhibitory 
loop, 1 BBIck8 1 . also competed with the Anti-VegF antibody 



l the BioVeris a 
scaffold to present ac 
screening methods. 



Thus. BBI was found tc 
ve binding peptides selected by v< 



Use of Alternative Fusion Partners for the Production 
of2BBIck81 

hi this Example, experiments conducted to evaluate alter- 
native fusion partners are described. The DNA sequence of 
die oligonucleotide primers used to amplify the dsbC gene (¥,. 
coli) from pET-40b(+) are provided below. These primers 



generate a BssHl 1 site ;il the 5' end and a Bamlll at the 3' end 
for cloning into P 2JM103-Gen4-2BBIck8 1 . 



TCCTTTACCGCTGGTC 



The DNA sequences oi the oligonucleotides that were 
annealed together to make a cassette (Alw44I-BamHI) lor 
fusing Hie P. menJociiia cutinase gene to Bli] with Linker 2. 
are provided below. 



CutinaseBBI+ 



TCCTATTCCG 



GATCCGGAATAGGATC 
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of PDI does not seem to significantly improve the titers of 
active 2BBIck8 1 in the fusion protein or obviate the need for 
activation of the BBI molecule. 

In order to increase the reduction potential of the fusion 
protein, which was contemplated to improve the BBI titers 
during growth, DsbO from Escherichia coli was used as a 
fusion partner for 2BBIck81 . The dsbC gene was amplified 
by PCR using Herculase Enhanced DNA polymerase as 
described by the manufacturer (Stratagene) using Dsb- 
CBBI-F and DsbC'BBI-R as primers (sequences shown 
above) and P E MOb(+ i ( Nouiuen ) as a template. The isolated 
PCR fragment was cloned into the vector p2JM103-Gen4- 
2BBIck81 (See. Example 4) as a BssHII-BamHI fragment. 
The correct sequence of the fusion gene w as verified by 1 )NA 
sequencing. In this case, the titers of the DsbC-2BBIck81 
fusion protein were significantly lower than the BCE- 
2BBIckS 1 fusion protein as judged on SI :'Atli gels and 
the titers of the active 2BBIck8 1 measured by trypsin inhibi- 



vell. 



i ^ Other proleii 
which has bee 



re prod 



it high 



•udon 



.-xpress 



aprE 
5,429,950, 



e the 



lisuliide 



aids. 



contemplated that higher titers ot active BBI will beobtained 
using fusion proteins other than the BCE103 cellulase cata- 
lytic domain. For example, in some embodiments, composi- 
tions such as thiol-disulfide oxidoreductases and/or protein 
disulfide isomerases find use as fusion proteins to help pro- 30 
duce correctly folded BBI moieties. In this embodiment, no 
additional activation step is needed under most circum- 
stances. In additional embodiments, other proteins produced 
at high titers in B. suktitis also find use as fusion partners I or 
example, the thermostable protein disulfide isomerase from 35 
the fungus Humicola insolens (hiPDI) has been used as a 
fusion partner to produce the light chain of immunoglobulin 
G (2 disulfides) in Bacillus hrcvis (See. Kajino et al., Appl. 
Env. Microbiol.. 66:638-642 [2000]). 

To determine whether hiPDI could be a better fusion part- 4u 
ner than B( "1 i 1 03 lor the production of BBI, this hiPDI gene 
was synthesized (DNA2.0) and cloned into the expression 
vector, p2.IM103-lnk2-2BBIck81 (See. Example 4) as a 
Bssilll-SacI fragment. In designing the symbolic gene, 
codi ns oecurri i li Irequci i I ly e> pres ed B 4 ; 
siihlilis genes were selected except in cases where restriction 
sites were introduced or deleted. In the final construction, the 
N-terminus of the mature hil'l )I gene was fused to the Apr! ' 
signal sequence and the t '-term mus was fused to a linker with 
an I'.nteropeptidase clea\eage site (Kajino et al., Appl. Env. 50 
Microbiol.. 66:638-642 |2000|). which in turn was fused lo 
2BBIck81 (See. FIG. 7). ffiis expression vector. p2.IM-PDI- 
EK-2BBIck8 1 . was used to transform B. snhiilis BG6006 and 
the production of the fusion protein was determined in MBI) 
medium (as described in f xample 1 ) with or without 2 mM 55 
If ME added 14 li after inoculation. 

As determined by SDS-PAGE gels, the production of the 
PDl-2BBfck81 fusion protein was typically somewhat less 
than the BCE-2BBck81 grown under identical conditions. 
The BBI titers (trypsin inhibition) measured from the PDI- w> 
2BBlck8 1 cell free supernatants were also typically less than 
the BCE-2BBfck81 fusion. As with fusions to BCE103, Hie 
measured activities of BBf when fused to PDI were higher 
when grown in 2 mM (3ME and the BBI activity was increased 
by the addition ol* |iME to the cell free supernatants after -5 
growth when grown in |5ME free medium (as described in 
Example 3 I. Thus, the thiol-disulfide oxidoreductase activity 



ibtilis (See 

herein incorporated by reference). The aprE-cutinase gene 
fusion as an EcoRI-Alw44I fragment (from pAK-15) was 
ligated with an Alw44I-BamHI linker oligonucleotide cas- 
sette (See, sequence above) into the p2JM103-lnk2- 
2BBIck8 1 (See, Example 4) that had been cut with EcoRI and 
BamHI. This cutinase-linker2-2BBIck81 expression vector 
(See, FIG. 8 for the EcoRI-BamHI aprE-cutinase-linker2 
sequence) was used lo transform B. subtilis BG6006 cells and 
the fusion protein was produced in MBD medium as 
described previously for the other fusion proteins (See, 
Example 1). In this case, the cutinase-linker2-2BBIck81 
fusion protein was not the major band observed on SDS- 
PAGI gels and the measured lipase titers (as measured Using 
the methods provided in U.S. Pat. No. 5.429,950) and BBI 
liters were much less (ca. 20 fold) than found with the BCE- 
2BBIck8 1 fusion protein. Also, the BBI titers in the cutinase 
fusion protein were not improved significantly when 3 mM 
fJME was added to the growth medium. Thus, the highest 
titers of active 2BBIck81 was consistently obtained by acti- 
vation of the BCE-2BBIck8 1 fusion protein. Nonetheless, it 
is contemplated that various fusion partners will find use in 
the present invention. 

All patents and publications mentioned in the specification 
are indicative of the levels of those skilled in the art to which 
the invention pertains. Ail patents and publications are herein 
incorporated by reference to the same extent as if each indi- 
\ idual publication w as specifically and individually indicated 
to be incorporated by reference. 

Having described the preferred embodiments of the 
present invention, it will appear to those ordinarily skilled in 
the art that various modifications may be made to the dis- 
closed embodiments, and that such modifications are 
intended to be within the scope of the present invention. 

Those of skill in the art readily appreciate that the present 
invention is well adapted to carry out the objects and obtain 
the ends and advantages mentioned, as well as those inherent 
therein. The compositions and methods described herein are 
representative of preferred embodiments, are exemplary, and 
are not intended as limitations on the scope of the invention. 
It is readily apparent to one skilled in the art that varying 
substitutions and modificatii ins may be made to the invention 
disclosed herein w ithout departing from the scope and spirit 
of the in 
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n illustratively described herein suitably may 
be practiced in the absence of any clement or elements, limi- 
tation or limitations which is not specificallj disclosed herein. 
The terms and expressions which have been employed tire 
used as lerms ol description and not ol limitation, and there is • 
no intention that in the use ol such terms and expressions of 
excluding any equivalents of the features shown and 
described or portions thereof, but it is recogm/ed that various 
modifications are possible within the -.cope of the invention 
claimed. Thus, it should be understood that although the 1 
present invention has been specifically disclosed by preferred 
embodiments and optional features, modification and varia- 



tion of the concents herein disclosed may be resorted to by 
those skilled in the art. and th t sucl modi _ itions and varia- 
tions are considered to be i\ i thin the scope of this invention as 
defined by the appended claims. 

The invention has been described broadly and genetically 
herein. Kach of the narrower species and subgeneric group- 
ings falling within the generic disclosure also form part of the 

tion v\ ith a pros iso or negative limitation remov ing any sub- 
ject matter from the genus, regardless ol w bother or not the 
excised material is specifically recited herein. 



SEQUENCE LISTING 
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-continued 



j Arg. Gly Glu Gin Val Gin Leu L 
55 

-i Trp Tyr Gly Gin Phe Val Asn T 

L Glu Ala Ala lie Asp Leu Gly I 
; 120 
e Leu Ser Asp Asn Asp F 

5 Phe Asp Glu Met Ser Glu Leu T 
150 1 
r Glu lie Ala Asn Glu Pro Asn G 

165 170 
1 I le Lys Pro Tyr Ala Glu Glu V 



lie Tyr Lys Glu Glu 



1 Asp Tyr Ala Leu Asp 
y Thr Ser Ala Ala Thr 



Glu Ala Gin Val Trp He Asp Phe 



Pro Gly Ala Asn Pro Thr Gly Gly Trp T 



Gin Gly Ala 
Gly Asp Gly 



Aon Pro Pro G 



t Cys Arg Cys Ser Asp N 



a Ala Leu Ser Tyr Pro Ala G 
395 

e Cys Tyr Glu Pro Cys Lys F 



Asp Asp Lyo Glu 



ling fusion peptide of SEQ I 



tgggtggact tgtcgctgca gcgatatgcg tctgaattcc tgtcatagtg cctgcaaaag 
ctgcgcatgt tataacctgt acgggtggac ctgtttttgc gtcgacatca cggacttctg 
ctatgagcca tgtaaaccaa gcgaggacga taaagagaac taa 



ial Sequence 

: synthetic fusion peptide 

> SEQUENCE : 4 

Asp Pro Asp Asp Glu Ser Ser Lys Pro Cys Cys Asp Gin Cys Ala C 
Tyr Asn Leu Tyr Gly Trp Thr Cys Arg Cys Ser Asp Met Arg Leu A 




Artificial Sequence 

synthetic fusion p 



31n Leu Lys Lys 



i Lys 



Trp Cys Gly His Cys Lys Ala Leu Ala Pro Glu Tyr Glu Glu Ala Ala 

35 40 45 

Thr Thr Leu Lys Glu Lys Asn He Lys Leu Ala Lys Val Asp Cys Thr 

Glu Glu Thr Asp Leu Cys Gin Gin His Gly Val Glu Gly Tyr Pro Thr 
65 70 75 80 

Leu Lys Val Phe firg Gly Leu Asp Asn Val Ser Pro Tyr Lys Gly Gin 

Arg Lys Ala Ala Ala He Thr Ser Tyr Met He 

Ala Val Ser Glu Val Thr Lys Asp Asn Leu Glu 

Asp Lys Ala Val Leu Val Ala Tyr Val Asp Ala 

Ser Glu Val Phe Thr Gin Val Ala Glu Lys Leu Arg Asp Asn Tyr P 



a Pro Ala He Val Leu Tyr Lys Asp Phe Asp 
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-continued 

Gly Ala Thr Pro Leu lie Gly Glu lie Gly Pro Glu Thr Tyr Ser Asp 

Tyr Met Ser Ala Gly lie Pro Leu Ala Tyr lie Phe Ala Glu Thr Ala 

Glu Glu Arg Lys Glu Leu Ser Asp Lys Leu Lys Pro lie Ala Glu Ala 

Gin Arg Gly Val lie Asn Phe Gly Thr lie Asp Ala Lys Ala Phe Gly 

Ala His Ala Gly Asn Leu Asn Leu Lys Thr Asp Lys Phe Pro Ala Phe 

Ala He Gin Glu Val Ala Lys Asn Gin Lys Phe Pro Phe Asp Gin Glu 

290 295 300 

Lys Glu He Thr Phe Glu Ala He Lys Ala Phe Val Asp Asp Phe Val 

Ala Gly Lys He Glu Pro Ser He Lys Ser Glu Pro He Pro Glu Lys 

Gin Glu Gly Pro Val Thr Val Val Val Ala Lys Asn Tyr Asn Glu He 

Val Leu Asp Asp Thr Lys Asp Val Leu He Glu Phe Tyr Ala Pro Trp 

Cys Gly His Cys Lys Ala Leu Ala Pro Lys Tyr Glu Glu Leu Gly Ala 

Leu Tyr Ala Lys Ser Glu Phe Lys Asp Arg Val Val He Ala Lys Val 

Asp Ala Thr Ala Asn Asp Val Pro Asp Glu He Gin Gly Phe Pro Thr 

405 410 415 

He Lys Leu Tyr Pro Ala Gly Ala Lys Gly Gin Pro Val Thr Tyr Ser 

Gly Ser Arg Thr Val Glu Asp Leu He Lys Phe He Ala Glu Asn Gly 

435 440 445 

Lys Tyr Lys Ala Ala He Ser Glu Asp Ala Glu Glu Thr Ser Ser Ala 

Thr Glu Thr Thr Thr Glu Thr Ala Thr Lys Ser Glu Glu Ala Ala Lys 

Glu Thr Ala Thr Glu His Asp Glu Leu Gly Ser Gly Ser Gly Asp Asp 
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Ala Ala Ser Cys Leu Ser Val Cya Ala Thr Val Ala Ala Ala Pro Leu 

Pro Asp Thr Pro Gly Ala Pro Phe Pro Ala Val Ala Asn Phe Asp Arg 

Ser Gly Pro Tyr Thr Thr Ser Ser Gin Ser Glu Gly Pro Ser Cys Arg 

50 55 60 

lie Tyr Arg Pro Arg Asp Leu Gly Gin Gly Gly Val Arg His Pro Val 
65 70 75 80 

lie Leu Trp Gly Asn Gly Thr Gly Ala Gly Pro Ser Thr Tyr Ala Gly 

Thr Ser Asn Ala Gly Thr Gly Arg Glu Met Leu Ala Cys Leu Asp Tyr 

115 120 125 

Leu Val Arg Glu Asn Asp Thr Pro Tyr Gly Thr Tyr Ser Gly Lys Leu 

130 135 140 

Asn Thr Gly Arg Val Gly Thr Ser Gly His Ser Gin Gly Gly Gly Gly 
145 150 155 160 

Ser lie Met Ala Gly Gin Asp Thr Arg Val Arg Thr Thr Ala Pro lie 

Gin Pro Tyr Thr Leu Gly Leu Gly His Asp Ser Ala Ser Gin Arg Arg 

Gin Gin Gly Pro Met Phe Leu Met Ser Gly Gly Gly Asp Thr He Ala 

Ser Gly Gly Ala Tyr Arg Gly Pro Ser Thr Ala Trp Phe Arg Phe Gin 

Leu Met Asp Asp Gin Asp Ala Arg Ala Thr Phe Tyr Gly Ala Gin Cys 

Ser Leu Cys Thr Ser Leu Leu Trp Ser Val Glu Arg Arg Gly Leu Asp 
Asn Aon Aop Pro He Pro Asp 



> SEQUENCE : 9 

Ala Cys Tyr Asn Leu Tyr Gly Trp Thr Cys 




<211> LEI-iGTH So 
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tinued 



> OTHER INFORMATION: synthetic 

> SEQUENCE: 11 

Asn Leu Arg Leu Ser Lys Leu Gly Leu Leu Met Lys Ser Asp His G 

His Ser Asn Asp Asp Glu Ser Ser Lys Pro Cys Cys Asp Gin Cys A 

20 25 30 

Cys Thr Lys Ser Asn Pro Pro Gin Cys Arg Cys Ser Asp Met Arg L 

Asn Ser Cys His Ser Ala Cys Lys Ser Cys He Cys Ala Leu Ser T 

Pro Ala Gin Cys Phe Cys Val Asp He Thr Asp Phe Cys Tyr Glu P 

65 70 7S 8 

Cys Lys Pro Ser Glu Asp Asp Lys Glu Asn His His His His His H 



> OTHER INFORMATI 



gacgacgaga gccccaaacc ctgttgcgac caacgcgcac gcacgaaacc aaaccctcca 
cagcgccggc gccccgatat gcgtctgaat agctgtcata gcgcacgcaa aagccgtacc 
cgcgccccga gccacccagc tcaatgtttt tgcgtcgaca tcacggactt ccgccacgag 



: synthetic 
> SEQUENCE: 13 

Asp Asp Glu Ser Ser Lys Pro Cys Cys Asp Gin Cys Ala Cys Thr 

Ser Asn Pro Pro Gin Cys Arg Cy3 Ser Asp Met Arg Leu Asn Ser 

His Ser Ala Cys Lys Ser Cys He Cys Ala Leu Ser Tyr Pro Ala 

35 40 45 

Cys Phe Cys Val Asp He Thr Asp Phe Cys Tyr Glu Pro Cys Lys 

Ser Glu Asp Asp Lys Glu Asn 




<4 00> SEQUENCE: 15 
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60 
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> LENGTH ■. 6 7 

> FEATURE : 

> OTHER INFORMATION : synthetic oligonucleotide 

> SEQUENCE: 22 

tcgacgcaaa aacaggtcca cccgtacagg ttataacatg cgcagctttt gcaggca 
tgacagg 



: Artificial Sequence 

22 !> OTHIiP T::F0P:4ATT'',): =y::t:^fl~ 

5 00> SEQUENCE: 23 

ctaaaccotg ttgcgatcaa tgcgcatgtg ttgttcagga ctggggtcac caccg 
gctgcagcga tatgcgtctg 

213> ORGANISM: Artificial Sequence 

223> OTHER INFORMATION: synthetic 

i00> SEQUENCE: 24 

aattcagacg catatcgctg cagcgacaac ggtggtgacc ccagtcctga acaac 
cgcattgatc gcaacagggt ttagagct 

213> ORGANISM: Artificial Sequence 
2 23> OTHER INFORMATION: synthetic 
!00> SEQUENCE-. 25 

caaaagctgt atctgcgttg ttcaggactg gggtcaccac cgttgttttt gcg 



> ORGANISM: Artificial Sequence 

> FEATURE : 

> OTHER INFORMATION: synthetic 

> SEQUENCE: 26 

tcgacgcaaa aacaacggtg gtgaccc 



> OFGANTSn Artificial S-quer.c 

> OTHER INFORMATION: synthetic 

> SEQUENCE: 2 7 
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<213> ORGANISM: Artificial Seque 
<22C> FEATURE : 

<223> OTHER INFORMATION: synthet 



<2 20> FEATURE : 

<223> OTHER INFORMATION: synthetic 



ctaaaccctg ttgcgatcaa tgcggttgtg ctcgttctaa cctggacgaa tgtcgctgca 
gcgatatgcg tctg 



> OTHER INFORMATION: synth 

> SEQUENCE: 3 0 



» LENGTH: 74 



> OTHER INFORMATION: synthetic 

> SEQUENCE : 3 1 



:ctg tgtcgctgca 



> SEQ ID N 

> LENGTH: 



> ORGANISM: Artificial Sequence 

> OTHER INFORMATION: synthetic 



;:eq id no 33 

211- EN ,TK "4 

<213> ORGANISM: Artificial Sequence 
<22C> FEATUPE- 

<223> OTHER INFORMATION : synthetic 
<4 00> SEQUENCE: 33 

ctaaaccctg ttgcgatcaa tgccagtgtg gtcgtctgca catgaaaacc tgtcgct 
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igacg catatcgctg cagcgacagg ttttcatgtg cagacgacca caccggcat 



Artificial Sequence 

> OTHER INFORMATION : synthetic 

> SEQUENCE: 3 5 

aattcctgtc atagtgcctg caaaagctgt atctgcgccc gtagtttgcc agct 



<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: synthetic 

<400> SEQUENCE: 3 6 

tcgacgcaaa aacattgagc tggcaaacta cgggcgcaga tacagctttt gcaggcacta 
tgacagg 




<213> ORGANISM: Artificial Sequence 

<220> FEATURE : 

<223> OTHER INFORMATION: synthetic 

<4 00> SEQUENCE: 3 7 

ctaaaccctg ttgcgatcaa tgcaactgta cgtactcaac ccctccacag tgtcgctgca 
gcgatatgcg tctg 



-2 11 LENGTH 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<4 00> SEQUENCE: 3 8 

aattcagacg catatcgctg cagcgacact gtggaggggt tgagtacgta cagttgcatt 
gatcgcaaca gggtttagag ct 
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> SEQUENCE: 4 0 

cgcaaaaaca gccacgagat tttgagcgtt tgcagataca gcttttgcat g 



rtificial Sequence 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 41 

ctgtatctgc tggtataatc aaatgacaac atgtttttgc gtcc 



> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 42 

cgcaaaaaca tgttgtcatt tgattatacc agcagataca gcttttgcat g 



» SEQ ID NO 43 



> FEATURE: 

> OTHER INFORMATION: synthetic oligonucleotide 
■ SEQUENCE: 43 

ctgtatctgc catcaacttg gcccgaattc atgtttttgc gtcgac 



nthetic oligonucleotide 
> SEQUENCE: 44 

cgcaaaaaca tgaattcggg ccaagttgat ggcagataca gcttttgca 



<211> LENGTH : SI 

<212> TYPE : DNA 

<213> DRGAHISM: Artificial Sequence 

<2 20> FEATURE: 

<223> OTHER INFORMATION: synthetic oligonucleotide 



:t.lt..,L.:'t.jC r.itc-ytygg a : yt at t c ttgtttttgc gtcgac 

> OTHER INFORMATION : synthetic oligonucleotide 

> SEQUENCE: 4 6 

cgcaaaaaca agaatacggt gcccacggat ggcagataca gctttt 
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- cont inued 



70 



30> SEQUENCE : 47 

ctgtatctgc aatcttcatt atcttcaaca gtgtttttgc gtcgacatca c 

Ll> LENGTH: 51 
L2> TYPE : DMA 

L3> ORGANISM: Artificial Sequence 

23> OTHER INFORMATION: synthetic oligonucleotide 
30> SEQUENCE: 48 

cgcaaaaaca ctgttgaaga taatgaagat tgcagataca gcttttgcat g 



» SEQ ID H 
> LENGTH: 



L Sequence 



ctgtatctgc acaccgtctc tttatcgccc gtgtttttgc gtcgacatca c 

> ORGANISM; Artificial Sequence 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 50 

cgcaaaaaca cgggcgataa agagacggtg tgcagataca gcttttgcat g 



50> SEQUENCE: 51 

ctgtatctgc cttacagatc aatctaaacc gtgtttttgc gtcc 



<210> SEQ ID 



: synthetic oligonucleotide 

> SEQUENCE: 52 

cgcaaaaaca cggtttagat tgatctgtaa ggcagataca gcttttgcat g 

> SEQ ID NO 53 



:4 00> SEQUENCE: 53 
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ctgtatctgc gttacaacat caatgggcat gtgtttttgc gtcgacat 

> SEQ ID NO 54 

> LENGTH : 51 

> TYPE : DNA 

> ORGANISM; Artificial Sequence 

> OTHER INFORMATION : synthetic oligonucleotide 

> SEQUENCE: 54 

cgcaaaaaca catgcccatt gatgttgtaa cgcagataca gcttttgc 

> ORGANISM: Artificial Sequence 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 55 

ctgtatctgc cgcgcatcac cgtatgattg gtgtttttgc gtcgacat 

> LENGTH : 51 

> ORGANISM: Artificial Sequence 

> OTHER INFORMATION: synthetic oligonucleotide 



cgcaaaaaca ccaatcatac ggtgatgcgc ggcagataca gcttttgcat g 



rtificial Sequence 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 5 7 

ctgtatctgc tcaacacaaa aaattccgca atgtttttgc c 



> ORGANISM Artificial Sequence 

> OTHER INFORMATION: synthetic oligonucleotide 

» SEQUENCE: S8 

cgcaaaaaca ttgcggaatt ttttgtgttg agcagataca gcttttgcat g 



> OTHER INFORMATION 

> SEQUENCE: 59 
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ficial Sequence 

: synthetic oligonucleotide 
> SEQUENCE : 60 

cgcaaaaaca tgttgcagag cgaaattgtg tgcagataca gc 



> SEQUENCE: 61 

ctgtatctgc ccggatcatg ttccgcatct ttgtttttgc gtcgac 



> FEATURE: 

> OTHER INFORMATION: synthetic oligonucleotide 



ratgatccg ggcagataca gcttttgcat g 



ctgtatctgc tcaggct 



12> TYPE: DNA 

13> ORGANISM: Artificial Sequence 
20> FEATURE: 

23> OTHER INFORMATION: synthetic oligonucleotid 



> SEQUENCE : 64 

cgcaaaaaca tgtagaaagc ggaaagcctg agcagataca gcttttgcat g 



rtificial Sequence 

> FEATURE: 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE : 6S 

tcaatgcgca tgtgaagaga tctggactat gctttgccgg tgttccgata tgcgtc 



> FEATURE: 

> OTHER INFORMATION: synthetic oligonucleotid 



<400> SEQUENCE: 66 
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-continued 



> ORGANISM : Artificial Sequence 

> FEATURE : 

> 1\y~ ^ H -T- '] it-1-t-i oligonucleotide 

> SEQUENCE: 6 7 

caaaagctgt gcttgtgaag agatctggac tatgctttgc ttttgcgtcg acatcacgg 

> SEQ ID NO 6 8 

> LENGTH : 59 

> TYPE : DNA 

> FEATURE : 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE : 6 8 

acgcaaaagc aaagcatagt ccagatctct tcacaagcac agcttttgca tgcactatg 



■ Artificial Sequence 

> OTHER INFORMATION : synthetic oligonucleotide 

> SEQUENCE: 69 

tcaatgcgca tgttgggccc ttactgtcaa aacatgccgg tgttccgata tgcgtc 

> ORGANISM: Artificial Sequence 

> OTHER INFORMATION: synthetic oligonucleotide 



cggaacaccg gcatgttttg acagtaaggg cccaacatgc gcattgatcg caacagg 



> SEQ ID NO 71 



s JRJANISM - ti i _-,l , 

> FEATURE: 

> OTHER INFORMATION: synthetic oligonucleotide 



caaaagctgt gcttgttggg cccttactgt caaaacatgc ttttgcgtcg a 

<210> SEQ ID NO 72 

<213> ORGANISM: Artificial Sequence 
:22C> FEATURE- 

<223> OTHER INFORMATION: synthetic oligonucleotide 
<4CC> SEQUENCE: -'2 

acgcaaaagc atgttttgac agtaagggcc caacaagcac agcttttgca 
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> SEQUENCE: 73 

ccaacgcgca tgtcttacag tactgtggac tacatgccgg tgtCccgata t 

> SEQ ID NO 74 

> LENGTH; 57 

> ORGANISM: Artificial Sequence 

» OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 7 4 

cggaacaccg gcatgtagtc cacagtactg taagacatgc gcattgatcg c 

> OTHER INFORMATION: synthetic oligonucleotide 



caaaagctgt gcttgtctta cagtactgtg gactacatgc ttttgcgtcg acatcacgg 

<211> LENGTH : 59 
<212> TYPE : DNA 

e213> ORGANISM : Artificial Sequence 
<220> FEATURE : 

<223> OTHER INFORMATION: synthetic oligonucleotide 
<400> SEQUENCE: 76 

acgcaaaagc atgtagtcca cagtactgta agacaagcac agcttttgca tgcactatg 



<210s SEQ ID NO 77 

<211> LENGTH: 56 

<212> TYPE ; DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: synthetic oligonucleotide 

<400> SEQUENCE: 77 

tcaatgcgca tgtactcttt ggaacagatc tccttgccgg tgttccgata tgcgtc 



<2 12, TYPE. :ka 

<213> ORGANISM: Artificial Sequence 

o:j> OTHEf INFORMATION- oynthetic oligonucleotide 

<4 00> SEQUENCE: 78 

cggaacaccg gcaaggagat ctgttccaaa gagtacatgc gcattgatcg caacagg 



<2U» LENGTH 59 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<223> OTHER INFORMATION: synthetic oligonucleotide 




<212> TYPE : DNA 

<223> OTHER INFORMATION: synthetic oligonucleotide 
<4 00> SEQUENCE: 82 

cggaacaccg gcaaggagta gaatcgatgt ttgtacatgc gcattgatcg caacagg 57 



<211> LENGTH: 59 

<213> ORGANISM: Artificial Sequence 
<22 0> FEATURE : 

<223> OTHER INFORMATION: synthetic oligonucleotide 
<4 00> SEQUENCE: 83 

caaaagctgt gcttgcacaa acatcgattc tactccttgt ttttgcgtcg acatcacgg 59 



<210> SEQ ID NO 84 

<211> LENGTH: 59 

<2 12> TYPE: DNA 

<213> ORGANISM : Artificial Sequence 

<220> FEATURE : 

<223> OTHER INFORMATION: synthetic oligonucleotide 

<400> SEQUENCE: 84 

acgcaaaaac aaggagtaga atcgatgttt gtgcaagcac agcttttgca tgcactatg 59 



<211> LENGTH: 56 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<223> OTHER INFORMATION: synthetic oligonucleotide 

<4 00> SEQUENCE: 85 

tcaatgcgca tgtacaaaaa tcgatcgtac tccttgccgg tgttccgata tgcgtc 56 
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> SEQUENCE : 8 6 

cggaacaccg gcaaggagta cgatcgattt ttgtacatgc gcattgatcg caacag 



rtificial Sequer. 



> SEQUENCE: 8 7 

caaaagctgt gcttgcacaa aaatcgatcg tactccttgt ttttgcgtcg acatcacgg 



: Artificial Sequence 
<220> FEATURE : 

<4 00> SEQUENCE : 88 

acgcaaaaac aaggagtacg atcgattttt gtgcaagcac agcttttgca 



<211> E'l iH S< 
<212> TYPE : DNA 

<213> ORGANISM: Artificial Sequence 

<223> OTHER INFORMATION: synthetic oligonucleotide 
<400> SEQUENCE: 89 

tcaatgcgca tgtcacctgc agacaactga aacatgccgg tgttccgata tgcgtc 56 



<213> ORGANISM: Artificial Sequence 

<4 00> SEQUENCE: 90 

cggaacaccg gcatgtttca gttgtctgca ggtgacatgc 




<2 20> FEATURE: 

<223> OTHER INFORMATION: 0ynthet ic oligonucleotide 
<4 00> SEQUENCE: 91 

caaaagctgt gcttgccacc tgcagacaac tgaaacatgt ttttgcgtcg acatcacgg 59 




<223> OTHER INFORMATION: synthetic oligonucleotide 
<4 :» ;;eo:jeii 'e r. 



acgcaaaaac atgtttcagt tgtctgcagq tggcaagcac agcttttgca tgcactatg 59 
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:ificial Sequence 

: synthetic oligonucleotide 

> SEQUENCE: 93 

tcaatgcgca tgtggctact tcatcccatc gatttgccgg tgttccgata tgcgtc 

> SEQ ID NO 94 

> LENGTH : 57 

> TYPE : DNA 

> ORGANISM: Artificial Sequence 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 94 

cggaacaccg gcaaatcgat gggatgaagt agccacatgc gcattgatcg caacagg 



> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 95 

caaaagctgt gcttgcggct acttcatccc atcgatttgt ttttgcgtcc 



<210> SEQ ID NO 96 

<211> LENGTH : 59 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 20> FEATURE : 

<223> OTHER INFORMATION : synthetic oligonucleotide 

<4 00> SEQUENCE: 96 




<210> SEQ ID NO 98 

<211> LENGTH: 57 

<212> TYPE: DNA 

<223> OTHER INFORMATION: synthetic oligonucleotide 



<4 00> SEQUENCE: 98 

cggaacaccg gcatttgtta gcaaggatac gtaaacatgc gcattgatcg caacagg 
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: synthetic oligonucleotide 
<4 00> SEQUENCE: 106 

acgcaaaaac agttaagagt aggtgttaac aggcaggcac agcttttgca tgcactatg 

<210> SEO ID NO 107 

<211> LENGTH : 57 

<212> TYPE: DMA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATtJRE: 

<223> OTHER INFORMATION : synthetic oligonucleotide 

<400> SEQUENCE: 107 

tcaatgcgca tgcgctcttc caactcattc taactgtcgg tgttccgata tgcgtct 



> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 108 

cggaacaccg acagttagaa tgagttggaa gagcgcatgc gcattgatcg caacagg 

> LENGTH : 59 

> FEATURE : 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE; 109 

caaaagctgt gcctgcgcgc ttcctacaca ctctaactgt ttttgcgtcg acatcacgg 



:ificial Sequence 

: synthetic oligonucleotide 
. SEQUENCE: 110 

acgcaaaaac agttagagtg tgtaggaagc gcgcaggcac agcttttgca tgcactatg 



> SEQUENCE: 111 

caaaagctgt gcctgccctt taggcctttg cccaccttgt ttttgcgtcg acatcac 

> [Ell TI • 

> ORGANISM: Artificial Sequence 

» OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 112 

acgcaaaaac aaggtgggca aaggcctaaa gggcaggcac agcttttgca tgcacta 
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-continued 



<4 00> SEQUENCE: 119 

aagctgtatc tgtacatgga tcgattggac accttgtttt tgcgtcgaca tcacgg 



<211> LENGTH: 56 
<212> TYPE : DNA 




<223> OTHER INFORMATION : synthetic oligonucleotide 
<400> SEQUENCE; 120 



<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 20> FEATURE: 

<223> OTHER INFORMATION: synthetic oligonucleotide 
<400> SEQUENCE: 121 

caaaagctgo gcatgtgtta ctacagattg gatcgaatgt ttttgcgtcg acatcacgg 59 



<211> LENGTH: 59 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<22 0> FEATURE: 

<223> OTHER INFORMATION: synthetic oligonucleotide 

<400> SEQUENCE: 122 

acgcaaaaac attcgatcca atctgtagta acacatgcgc agcttttgca tgcactatg 



<210> SEQ ID NO 123 

<2 11> LENGTH: 61 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE : 

<223> OTHER INFORMATION: synthetic oligonucleotide 

<400> SEQUENCE: 123 

caaaagctgt gcctgcccaa cactttggac tcatatgtgt ttttgcgtcg acatcacgga 



> ORGANISM: Artificial Sequence 

> FEATURE: 

> OTHER INFORMATION: synthetic oligonucl 



:aaagtgtt gggcaggca 




<4 00> SEQUENCE: 12 5 
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i INFORMATION: synthetic oligonucleotide 



acgcaaaaac attggtggaa ttgagagtag taacatgcgc agcttttgca 

> ORGANISM: Artificial Sequence 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 12 7 



> TYPE : DNA 

> ORGANISM: Artificial Sequence 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE : 128 

tgtcgacgca attacaagga gaaacgttat cgttttccgg acaaagacag ottttgcatg 



» SEQ ID K 
> LENGTH: 



: Artificial Sequence 

> FEATURE: 

> OTHER INFORMATION : synthetic oligonu 



caaaagctgt gcttgtaaac acaa 



<213> ORGANISM: Artificial Sequence 
<2 20> FEATURE: 

<223> OTHER INFORMATION: synthetic oligonucleotide 
<400> SEQUENCE: 130 

tcgacgcaaa aacataaaag acgtacgttg tgtttacaag cacagctttt gcat 



: nynthetic oligonucl 
> SEQUENCE: 131 
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, OTHER INFORMATION : synthetic oligonucl 
» SEQUENCE: 13 2 

ctcatcgtca actaaagcag ctccacctg 



lficial Sequence 

20> FEATURE: 

;-.!> VI HE}-. INFORMATION: sy^rrtk oligonucleotide 



il Sequence 

synthetic oligonucleotide 



tccagg atcagatgga ggagttgggt caggttca 



> LENGTH : 79 

> TYPE : DNA 

> ORGANISM: Artificial Sequence 

> FEATURE : 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 13 5 

gatccggcga acctgcgtct gtctaagctt ggcctgctta tgaaatcaga ccatcagcac 




<210> SEQ ID NO 137 




<lli> ORGANISM Artificial Sequence 

<223> OTHER INFORMATION: synthetic oligonucleotide 

<4 0i> ceq::ei;ce. 13^ 

gatccaaaat cagaccatca gcacagcaat gacgatgaga get 
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<400> SEQUENCE : 138 

ctcatcgtca ttgctgtgct gatggtctga ttttg 35 



<210> SEQ ID NO 139 

<211> LENGTH: 205 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION: synthetic oligonucleotide 

<400> SEQUENCE: 139 

gatccaggag aaccggaccc aacgccccca agtgatccag gagagtatcc agcatgggat 

tcaaatcaaa tttacacaaa tgaaattgtg tatcataacg gtcagttatg gcaagcgaaa 

tggtggacac aaaatcaaga gccaggtgac ccatacggtc cgtgggaacc actcaaatct 

gacccagatt cagacgatga gagct 



<211> LENGTH: 197 

<213> ORGANISM ; Artificial Sequence 

<2 20> FEATURE: 

<223> OTHER INFORMATION : synthetic oligonucleotide 



ctcatcgtct gaatctgggt cagatttgag tggttcccac ggaccgtatg ggt^ 

=tcttgattt tgtgtccacc atttcgcttg ccataactga ccgttatgat aca< 

atttgtgtaa atttgatttg aatcccatgc tggatactct cctggat 

cgggtccggt tctcctg 



<211> LENGTH: 6 
<212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 
<223> OTHER INFORMATION: linker peptide 
<400> SEQUENCE: 141 

Trp Gly Asp Pro His Tyr 



<210> SEQ ID NO 142 

RGANISM: Artificial Sequence 
<2 20> FEATURE : 

<223> OTHER INFORMATION: linker peptide 
<400> SEQUENCE: 142 








<2 10> SEQ ID HO 149 
<211> LENGTH- S» 
<212> TYPE: DNA 

<213> ORGANISM- Artificial Sequence 
<22„> FEATURE 

-22?> 2THER I"F3KM/- TI Of] . :it h e t ic oligonucleotide 
<4 00> SEQUENCE 14 9 

atcgtctgga tccggatagt gggggtctcc ccaagatgct gattctctta ttttttccc 




<400> SEQUENCE : ISO 
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:atgg gatcattgtt gtcagatgct gat 



> TYPE: DNA 

1 FI I ~ tic oligonucleotide 

> SEQUENCE : 151 

atcgtctgga tccgggttgg gatctgcaac tacagatgct gat 



athetic oligonucleotide 
> SEQUENCE : 152 

gcataaggat gagtcatctg cagcg 



> SEQUENCE : 153 

atcgtctgga tccggatagt gggggtctcc ccacggttct cctggat 



> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 154 

atcgtctgga tccggtatgg gatcattgtt gtccggttct cctggat 



> ORGANISM: Artificial Sequence 

> FEATURE: 

> OTHER INFORMATION : synthetic oligonucleotic 



atcgtctgga tccgggttgg gatctgcaac taccggttct cctggatcag atggcg 

> ORGANISM: Artificial Sequence 

> OTHER INFORMATION: linker peptide 

> SEQUENCE: 156 

Trp Gly Asp Pro His Tyr 
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tal Sequence 
: linker peptide 
» SEQUENCE: 1S7 

Asp Asn Asn Asp Pro lie 




» ORGANISM: Artificial Sequence 

> OTHER INFORMATION : linker peptide 

> Bv ElN:'E ■ 1 ' 

He Pro Pro Ser Asp Pro Thr Pro Pro Ser Asp Pro Gly Glu Pro Asp 

20 



<210> SEQ ID NO 161 

<211> LENGTH: 21 

<212> TYPE: PRT 

<223> OTHER INFORMATION: linker peptide 

<4 00> SEQUENCE : 161 

He Pro Pro Ser Asp Pro Thr Pro Pro Ser Asp Pro Gly Glu Pro Val 
1 S 10 15 

Val Ala Asp Pro Asn 



<210> SEQ ID NO 162 

<211> LENGTH: 40 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE : 

<223> OTHER INFORMATION: synthetic oligonucleotide 

<4GG> SEQUENCE: 162 

gatccaggtg gagacgacga tgacaaagac gatgagagct 




<4 00> SEQUENCE: 175 
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al Sequence 
synthetic oligonucleotide 



ctcatcgtca ttgctgtgct gatggtctga tttgtagtgt gcagcgcctg 

> ORGANISM : Artificial Sequence 

> OTHER INFORMATION: synthetic oligonucleotide 

> SEQUENCE: 177 

gatccaggcg ctgcacacta cgtagaattt caagacgatg agagct 



<212> TYPE : DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE : 

<223> OTHER INFORMATION: synthetic oligonucleotide 



cgtagtgtgc agcgcctg 



> OTHER INFORMATION: cleavage site 

> SEQUENCE: 179 

Asp Asn Asn Asp Pro lie Pro Asp P 



<212> TYPE : 



ial Sequence 

: synthetic fusion peptide 
> SEQUENCE: 180 

Asp Asp Glu Ser Ser Lys Pro Cys Cys Asp Gin Cys Ala Cys Thr Lys 

Ser Asn Pro Pro Gin Cys Arg Cys Ser Asp Met Arg Leu Asn Ser Cys 

His Ser Ala Cys Lys Ser Cys Ala Cys Tvr Asn Le'i Tvr 01 V Trp Thr 

50 55 60 

Cys Phe Cys Val Asp Tie Thr Asp Phe Cys Tyr Glu Pro Cys Lys Pro 

Ser Glu Asp Asp Lys Glu Asn 
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al Sequence 
synthetic oligonucleotide primer 



aacatgagcg cgcaggctga tgacgcggca attcaacaaa cgttag 
U0> SEQ ID NO 182 

213> ORGANISM: Artificial Sequence 

223> OTHER INFORMATION : aynthetic oligonucleotide primer 
100> SEQUENCE: 182 

ac cactagttga tcctttaccg 



<210> SEQ ID NO 183 

<211> LENGTH: 60 

<213> ORGANISM; Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: synthetic oligonucleotide 

<400> SEQUENCE: 183 

tgcacttctc tgctttggtc tgttgaacgc agaggtcttg acaacaatga tcctattccg 



<210> SEQ ID NO 184 

<211> LENGTH: SO 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: synthetic oligonucleotide 

<4 00> SEQUENCE: 184 

gatccggaat aggatcattg ttgtcaagac ctctgcgttc aacagaccaa agcagagaac 



We claim: 

1. A method for producing ;i Vl-X.il" binding peptide in a 
protease inhibitor scaffold in a bacterial cell comprising. 

introducing a UNA construct into a bacterial cell, wherein 
said DNA construct comprises a heterologous DNA 
sequence encoding a protease inhibitor, said inhibitor 
being a liowman-Birk Inhibitor iBBI i comprising SEQ 
II ) NO: 1 3. wherein the trypsin and .-rein molrpsin loop 
of SEQ ID NO: 13 is replaced with a VEGF binding jQ 
peptide consisting of SEQ ID NO: 9: 

culturing said bacterial cell under suitable culture condi- 
tions to allow expression of the heterologous DNA 
sequence, and 

producing said VEGF binding peptide in a protease inhibi- 5S 
tor scaffold. 

2. The method of claim 1. further comprising recovering 
said VEGF binding peptide in a protease inhibitor scaffold. 

3. The method of claim 2. further comprising the step of 
activating said VEtiE binding peptide m a protease inhibitor 
scaffold. 

4. The method of claim 3, wherein said activating is accom- 
plished by exposing said VEGF binding peptide in a protease 
inhibitor scaffold to at least one reagent selected from the 
group consisting of compositions that reduce disulfide bonds, 65 
compositions that oxidize disulfide bonds, and compositions 
that alter the redox potential. 



5. The method of claim 1. wherein said bacterial cell is a 
member of the genus Bacillus. 

6 The method of claim 1. further comprising introducing 
into the bacterial cell a second nucleic acid sequence encod- 
ing a thiol disulfide oxidoreductase or chaperone. 

7. ITie method of claim 1 . wherein said VEGF binding 
peptide in a protease inhibitor scaffold is expressed as a 

8. The method of claim 7, wherein said fusion protein 
further comprises a cellulase catalytic domain, a cleavage 
site, and said VEGF binding peptide in a protease inhibitor 
scaffold. 

9. The method of claim 7, wherein said fusion protein is 
processed by a protease or acid heat treatment to liberate said 
VEGF binding peptide in a protease inhibitor scaffold. 

10. The method of claim 7, wherein the fusion protein 
further comprises at least one linker sequence. 

11. The method of claim 10, wherein said linker sequence 
is selected from the group consisting oi SEQ ID NOST41- 
143. 

12. An isolated polynucleotide encoding a Bowman-Birk 
Inhibitor (BBI) comprising SEQ ID NO: 13. wherein the 
trypsin and/or chymotrpsin loop of SEQ ID NO: 13 is 
replaced with SEQ ID NO: 9. 

13. An expression vector comprising a polynucleotide 
sequence, wherein said polynucleotide sequence encodes a 
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Bowman-Birk inhibitor comprising SEQ ID NO: 13, wherein 
the trypsin and/or chymotrypsin loop of SEQ ID NO: 13 is 
replaced with SEQ ID NO: 9. 

14. A host cell transformed with the vector of claim 13 

15. The host cell of claim 14. wherein said host cell is a 5 
Bacillus species cell. 

16. A method for producing a VEGF binding peptide in a 
protease inhibitor scaffold in a bacterial cell comprising, 

introducing a DNA construct into a bacterial cell, wherein 
said DNA construct comprises a heterologous DNA to 
sequence encoding a protease inhibitor, said inhibitor 
being a Bowman-Birk Inhibitor (BB1) comprising SHQ 
ID NO: 13, wherein the trypsin and/or chymotrpsin loop 
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of SEQ ID NO: 13 is replaced with a \ EC if binding 
peptide consisting of SEQ ID NO: 9. and wherein SEQ 
ID NO: 1 3 is truncated at the N- and/or C -terminus, w ith 
as many as seven N-terminal residues up to the first 
cysteine (Cys8 ) and/or as many as nine C'-lerminal resi- 
dues up to the last cysteine (Cys62) removed; 
cultunnu said bacterial cell under suitable culture condi- 
tions to allow expression of the heterologous DNA 
sequence, and 

producing said VEGF binding peptide in a protease inhibi- 
tor scaffold. 



